The instant application contains an electronically submitted Sequence Listing in ASCII text file format which is hereby incorporated by reference in its entirety.
The field of the invention generally relates to modified carrier proteins comprising one or more GlycoTags and the use of such modified carrier proteins in efficient O-linked glycosylation, for example using PglL.
Protein glycosylation is a common posttranslational modification in bacteria by which glycans are covalently attached to surface proteins, flagella, or pili, for example. [1]. Glycoproteins play roles in adhesion, stabilization of proteins against proteolysis, and evasion of the host immune response. [1]. Two protein glycosylation mechanisms are distinguished by the mode in which the glycans are transferred to proteins: one mechanism involves the transfer of carbohydrates directly from nucleotide-activated sugars to acceptor proteins (used in, e.g., protein O-glycosylation in the Golgi apparatus of eukaryotic cells and flagellin O-glycosylation in some bacteria). A second mechanism involves the preassembly of a polysaccharide onto a lipid-carrier (by glycosyltransferases) which is then transferred to a protein acceptor by an oligosaccharyltransferase (OTase). [1]. This second mechanism is used in, e.g., N-glycosylation in the endoplasmic reticulum of eukaryotic cells, the well-characterized N-linked glycosylation system of Campylobacter jejuni, and the more recently characterized O-linked glycosylation systems of Neisseria meningitidis, Neisseria gonococcus, and Pseudomonas aeruginosa. [1]. For O-linked glycosylation (O-glycosylation), glycans are generally attached to a serine or threonine residue on the protein acceptor. For N-linked glycosylation (N-glycosylation), glycans are generally attached to an asparagine residue on the protein acceptor. See generally [2].
The two best understood glycosylation systems are the C. jejuni N-linked glycosylation system and the Neisseria O-linked glycosylation system. [1], [3]. In these two systems, a polysaccharide (glycan donor) linked to an undecaprenyl pyrophosphate (UndPP) lipid-carrier is translocated (flipped) to the periplasm by a flippase. [2], [3]. In the periplasm, an oligosaccharyltransferase (OTase) transfers the glycan to a protein acceptor (pilin). [2], [3]. The OTase of C. jejuni (PglB) transfers the glycan to the asparagine (N) in the conserved pilin pentapeptide motif D/E-X1-N-X2-S/T (where X1 and X2 are any residues except proline). [4]. The OTase of N. meningitidis (NmPglL) transfers the glycan to Ser63 in the N. meningitidis pilin PilE sequence (“sequon”) (N)-SAVTEYYLNHGEWPGNNTSAGVATSSEIK-(C) (SEQ ID NO: 140, corresponding to residues 45-73 of mature N. meningitidis PilE sequence SEQ ID NO: 137). [1], [3], [5]. Until this disclosure, the pilin sequence onto which other OTases (from N. gonorrhoeae, N. lactamica, or N. shayeganii for example) transfer glycan was not known (see [6]).
Conjugate vaccines (comprising a carrier protein covalently linked to an immunogenic glycan) have been a successful approach for vaccination against a variety of bacterial infections. However, the chemical methods by which they are routinely produced are complex and comparatively inefficient ([4] at
For example, carrier proteins AcrA and EPA were N-glycosylated in E. coli using heterologous polysaccharide as glycan donors and C. jejuni PglB because AcrA and EPA were first modified to incorporate an appropriate periplasmic signal sequence and at least one copy of the PglB sequon sequence D/E-X1-N-X2-S/T (a “GlycoTag”). [4]; see also [7], [8], [9], [10], [11] (all of which are incorporated herein by reference in their entireties). The use of PglB-based bioconjugation production is limited because PglB only accepts certain sugar substrates: those containing an acetamido group at position C-2 of the reducing end and those that do not possess a β1, 4 linkage between the first two sugars (i.e., the linkage between sugars “S-2” and “S-1”, the first sugar (S-1) comprising the reducing end and S-2 being adjacent to S-1). [3], [12], [13].
To overcome this limitation of PglB-based systems and because Neisserial PglLs are “promiscuous” with respect to sugar substrates ([3]), an O-glycosylation system using the PglL OTase from Neisseria meningitidis has been the focus of recent work ([1], [14], [15], [16]; see also [6]).
For example, carrier proteins EPA, TTc, and CTB were O-glycosylated by N. meningitidis PglL in Shigella flexneri using polysaccharides which were endogenous to the Shigella flexneri host cell as glycan donors (“endogenous polysaccharide”) because each carrier protein was modified to incorporate a periplasmic signal sequence and one copy of the N. meningitidis PilE sequon sequence
But like its predecessor, the applicability of this NmPglL work is limited at least because only O-glycosylation by NmPglL paired with NmPilE sequon sequences was demonstrated and the system showed an unfortunate bias toward CTB as carrier protein (CTB was more effective than the desirable carrier protein, EPA). [3]; see also [5].
An array of PglL OTases and pilin sequons are needed that may be optimally paired for efficient O-glycosylation of a variety of carrier proteins, especially EPA, and at internal glycosylation sites.
In one aspect, the present invention is the first to describe certain pilin sequences and modified carrier proteins comprising them, optionally wherein the pilin sequence is O-glycosylated by an OTases from NmPglL or a homologue thereof (such as OTases from N. gonorrhoeae, N. lactamica, or N. shayeganii). In another aspect, the present invention provides efficient O-glycosylation of a variety of glycotagged carrier proteins, especially EPA, and with GlycoTags located at N-terminal, C-terminal, and/or internal carrier protein residues (internal GlycoTags being expected to improve conjugate characteristics such as stability over time).
Embodiments of the present invention include, but are not limited to:
The present invention provides modified carrier proteins incorporating one or more GlycoTag and their use for in vivo or in vitro bioconjugation.
To facilitate an understanding of the present invention, a number of terms and phrases are defined below. Alternate forms (tenses) of these terms and phrases are also encompassed herein. Unless otherwise noted, technical terms are used according to conventional usage. Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCR Publishers, Inc., 1995 (ISBN 1-56081-569-8).
“Comprise” (“comprising” or “comprises”) as used herein is open-ended and means “including, but not limited to.” “Having” is used herein as a synonym of comprising. It is understood that wherever embodiments are described herein with the language “comprising,” such embodiments encompass those described in terms of “consisting of” and/or “consisting essentially of”. “Comprises therein” or “comprising therein” means that the referenced molecule, amino acid sequence, or nucleotide sequence has incorporated within it a GlycoTag molecule, amino acid sequence or nucleotide sequence, respectively. With respect to, for example, a “carrier protein comprising therein a GlycoTag,” the nucleotide sequence encoding that carrier protein has, between the 5′ and 3′ ends, a nucleotide sequence encoding a GlycoTag, likewise the carrier protein amino acid sequence has, between the N- and C-terminus, a GlycoTag amino acid sequence.
As used in the description and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a GlycoTag” encompasses one or more GlycoTags.
“About” or “approximately” mean roughly, around, or in the regions of. The terms “about” or “approximately” further mean within an acceptable contextual error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured, i.e. the limitations of the measurement system or the degree of precision required for a particular purpose. When the terms “about” or “approximately” are used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. For example “between about 5.5 to 6.5 g/l” means the boundaries of the numerical range extend below 5.5 and above 6.5 so that the particular value in question achieves the same functional result as within the range. For example, “about” and “approximately” can mean within 1 or more than 1 standard deviation as per the practice in the art. Alternatively, “about” and “approximately” can mean a range of up to 20%, up to 10%, up to 5%, or up to 1% of a given value.
The term “and/or” as used in a phrase such as “A and/or B” is intended to include “A and B,” “A or B,” “A,” and “B.” Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following embodiments: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
Unless specifically stated, a process comprising a step of mixing two or more components does not require any specific order of mixing. Components can be mixed in any order. Where there are three components then two components can be combined with each other, and then the combination may be combined with the third component, etc. Similarly, while steps of a method may be numbered (such as (1), (2), (3), etc. or (i), (ii), (iii)), the numbering of the steps does not itself mean that the steps must be performed in that order (i.e., step 1 then step 2 then step 3, etc.). In certain embodiments, the word “then” is used to specify the order of a method's steps.
“Essentially the same” herein means a high degree of similarity between at least two molecules (including structure or function) or numeric values such that one of skill in the art would consider the difference to be immaterial, negligible, and/or statistically insignificant. For example, a first polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule is “essentially the same” as a second polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule herein if the first has only immaterial differences in structure and function as compared to the second. “Essentially the same” herein encompasses “the same.”
An “effective amount” means an amount sufficient to cause the referenced effect or outcome. An “effective amount” can be determined empirically and in a routine manner using known techniques in relation to the stated purpose. In certain embodiments, a composition comprises an immunologically effective amount of an antigen, adjuvant, or both. In certain embodiments, an “effective amount” in the context of administering a therapy (e.g. an immunogenic composition or vaccine of the invention) to a subject refers to the amount of a therapy which has a prophylactic and/or therapeutic effect(s). In certain embodiments, an “effective amount” refers to the amount of a therapy which is sufficient to achieve one, two, three, four, or more of the following effects: (i) reduce or ameliorate the severity of a bacterial infection or symptom associated therewith; (ii) reduce the duration of a bacterial infection or symptom associated therewith; (iii) prevent the progression of a bacterial infection or symptom associated therewith; (iv) cause regression of a bacterial infection or symptom associated therewith; (v) prevent the development or onset of a bacterial infection, or symptom associated therewith; (vi) prevent the recurrence of a bacterial infection or symptom associated therewith; (vii) reduce organ failure associated with a bacterial infection; (viii) reduce hospitalization of a subject having a bacterial infection; (ix) reduce hospitalization length of a subject having a bacterial infection; (x) increase the survival of a subject with a bacterial infection; (xi) eliminate a bacterial infection in a subject; (xii) inhibit or reduce a bacterial replication in a subject; and/or (xiii) enhance or improve the prophylactic or therapeutic effect(s) of another therapy.
“Subject” refers to an animal, in particular a mammal such as a primate (e.g. human).
“Essentially free,” as in “essentially free from” or “essentially free of,” means comprising less than a detectable level of a referenced material or comprising only unavoidable levels of a referenced material (trace amounts).
The word “substantially” does not exclude “completely” e.g. a composition which is “substantially free” from Y may be completely free from Y. “Substantially pure” refers to material which is at least 50% pure (i.e., free from contaminants), at least 90% pure, at least 95% pure, at least 98% pure, or at least 99% pure.
As is conventional, the designation “NH2” or “N-” refers to the N-terminus of an amino acid sequence and the designation “COOH” or “C-” refers to the C-terminus of an amino acid sequence.
“Internal”, “Interior” as used herein with respect to a protein, residue, or amino acid sequence means located between the N-terminus and the C-terminus.
“Fragment” is a nucleotide or polypeptide comprising “n” consecutive nucleic acids or amino acids, respectively, of the reference sequence and wherein “n” is any integer that is less than the total number of amino acids in the reference sequence. In certain embodiments, “n” is any integer between 1 and 100. In this way, a “fragment thereof” of a hypothetical 100 residue long reference sequence (SeqX) may consist of any 1 to 99 consecutive amino acids of SeqX. In certain embodiments, a fragment consists of 10, 20, 30, 40 or 50 contiguous amino acids of the full length sequence. Fragments may be readily obtained by removing “n” consecutive amino acids from either or both of the N-terminus and C-terminus of the full length reference polypeptide sequence. Fragments may be readily obtained by removing “n” consecutive nucleic acids fom either or both of the 3′ and 5′ ends of the nucleotide sequence that encodes the full length reference polypeptide sequence. An “immunogenic fragment” as used herein consists of “n” consecutive amino acids of an antigen sequence and is capable of eliciting an antibody or immune response in a mammal. Fragments of a polypeptide, for example, can be produced using techniques known in the art, e.g. recombinantly, by proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a polypeptide can be generated by removing one or more nucleic acids from the 3′ or 5′ end (for a terminal fragment) or by removing one or more nucleic acids from both 3′ and 5′ ends (for an internal fragment) of a nucleotide sequence that encodes the polypeptide's full length amino acid sequence.
“Operably linked” or “operatively linked” means linked so as to be “operational”, for example, the configuration of polynucleotide sequences for recombinant protein expression. In certain embodiments, “operably linked” refers to the art-recognized positioning of, e.g., nucleic acid components such that the intended function (e.g., expression) is achieved. A person with ordinary skill in the art will recognize that under certain circumstances (e.g., a cleavage site or purification tag), two or more components “operably linked” together are not necessarily adjacent to each other in the nucleic acid or amino acid sequence (contiguously linked). A coding sequence that is “operably linked” to a “control sequence” (e.g., a promoter, enhancer, or IRES) is ligated in such a way that expression of the coding sequence is under the influence or control of the control sequence. A person with ordinary skill in the art will recognize that a variety of configurations are functional and encompassed.
“Recombinant” means artificial or synthetic. In certain embodiments, “recombinant” indicates the referenced amino acid, polypeptide, conjugate, antibody, nucleic acid, polynucleotide, vector, cell, composition, or molecule was made by an artificial combination of two or more molecules (e.g., heterologous nucleic acid or amino acid sequences). Such artificial combination includes, without limitation, chemical synthesis and genetic engineering techniques. In certain embodiments, a “recombinant polypeptide” refers to a polypeptide that has been made using recombinant nucleic acids (nucleic acids introduced into a host cell). In certain embodiments, a recombinant nucleic acid is not heterologous (e.g., wherein the recombinant nucleic acid is a second copy of a nucleic acid innately present within a host cell). A “transgene” herein means a polynucleotide introduced into a cell, therefore a transgene is recombinant.
“Mutant” and “Modified” are given their well-understood and customary meanings and at least signify that the referenced molecule is altered (structure and/or function) as compared to control (e.g., wild type molecule or its naturally occurring counterpart) under comparable conditions or signify that the referenced numeric value is altered (increased or decreased) as compared to that of control under comparable conditions.
“Conservative” amino acid substitutions or mutations refer to the interchangeability of residues having similar side chains, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids. However, as used herein, in some embodiments, conservative mutations do not include substitutions from a hydrophilic to hydrophilic, hydrophobic to hydrophobic, hydroxyl-containing to hydroxyl-containing, or small to small residue, if the conservative mutation can instead be a substitution from an aliphatic to an aliphatic, non-polar to non-polar, polar to polar, acidic to acidic, basic to basic, aromatic to aromatic, or constrained to constrained residue. Further, as used herein, A, V, L, or I can be conservatively mutated to either another aliphatic residue or to another non-polar residue. The table below shows exemplary conservative substitutions.
“Isolated” or “purified” herein means a polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule in a form not found in nature. This includes, for example, a polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule having been separated from host cell or organism (including crude extracts) or otherwise removed from its natural environment. In certain embodiments, an isolated or purified protein is a protein essentially free from all other polypeptides with which the protein is innately associated (or innately in contact with). For example, “isolated PglL” or “purified PglL” includes the recombinant PglL protein essentially free from other periplasmic polypeptides that the PglL protein would otherwise be associated with (in contact with) inside the host cell. For example, an “isolated O-glycosylated modified carrier protein” or “purified O-glycosylated modified carrier protein” may have been separated from un-O-glycosylated modified carrier protein (e.g., following in vitro conjugation steps). In certain embodiments, “isolated” or “purified” also means a protein is not bound to an antibody or antibody fragment. In certain embodiments, an isolated or purified protein does not include a collection of the protein's components (sub-parts). For example, wherein the protein is a complex of protein components, an “isolated/purified complex” may not include a collection of the complex's components (unbound to each other) obtained after, for example, application of sodium dodecyl sulfate (SDS) or 2-Mercaptoethanol (both of which break down the bonds between protein components in a complex).
A “Pharamaceutical-grade” or “pharmaceutically acceptable” polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule is isolated, purified, or otherwise formulated to be essentially free from impurities (e.g., essentially free from components (e.g., naturally occurring components) which are unacceptably toxic to a subject to which the polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule may be administered). A pharmaceutical-grade polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule is not a crude polypeptide, conjugate, antibody, polynucleotide, vector, cell, composition, or molecule.
“Homologue(s)” as used herein means two or more molecules that, despite originating from a different genus or species of organism and/or having divergent structure, have essentially the same function. To denote similar functionality herein, “PglL” or “PilE” may be used to refer to oligosaccharyltransferases or pilin, respectively, even if alternate designations are used in the art (for example, “PaPglL” herein encompasses the oligosaccharyltransferase referred to as “PglO” from Neisseria gonorrhoeae and that is a known homologue of N. meningitidis PglL ([16], [17]; see also [14], [18], [19])).
“Endogenous” as used herein means the referenced two or more polypeptides, conjugates, antibodies, polynucleotides, vectors, cells, compositions, or molecules originate from the same species of organism, or, in the case of a synthetic or recombinant polypeptide for example, consists essentially of the structure and function as those that originate from the same species of organism. With respect to PglL, for example, “endogenous” refers to the relationship of the subject PglL to the subject pilin (or GlycoTag therefrom) and means that they both originate from the same species of organism, or consist essentially of the structure and function as those that originate from the same species of organism. As an example, a Neisseria meningitidis PglL is “endogenous” to N. meningitidis PilE (and in this way, a PglL may be said to be “endogenous to” the referenced pilin). As a further example, a Neisseria meningitidis PglL is “endogenous to” N. meningitidis cells (especially control or wild type N. meningitidis cells).
“Heterologous” as used herein means the referenced two or more things are not associated with each other in nature. In certain embodiments, a protein is “heterologous” to a cell if a comparable naturally occurring cell (e.g., wild type cell under comparable conditions) would not produce that protein. In certain embodiments, a periplasmic signal sequence is “heterologous” to a protein (or to the protein's amino acid sequence) because the comparable naturally occurring protein (e.g., wild type protein) would not be operatively linked to that signal sequence.
“Nucleic acid,” “nucleotide,” “polynucleotide” is used to refer to ribonucleic acid (RNA), deoxyribonucleic acid (DNA), a polyribonucleotide molecule, or a polydeoxyribonucleotide molecule whether or not modified, unmodified, or synthetic. Thus, polynucleotides as defined herein may include single- and double-stranded DNA, DNA including single- and double-stranded regions, single- and double-stranded RNA, and RNA including single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or include single- and double-stranded regions. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are “polynucleotides” as that term is intended herein. DNAs or RNAs may be synthetic (including, without limitation, the nucleic acid subunits that together form the polynucleotide). Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritiated bases, are included within the term “polynucleotides” as defined herein. In general, the term “polynucleotide” embraces all chemically, enzymatically and/or metabolically modified forms of unmodified polynucleotides. Polynucleotides can be made by a variety of methods, including in vitro recombinant DNA-mediated techniques and by expression of DNAs in cells and organisms. Polynucleotides include genomic and plasmid nucleic acids. DNA includes, without limitation, genomic (nuclear) DNA having introns, e.g., as well as recombinant DNA such as cDNA (e.g., introns removed). RNA includes, without limitation, mRNA and tRNA. It is envisioned that codon optimization is utilized for any recombinant expression of a polynucleotide molecule of the present invention.
“Vector” refers to a vehicle by which nucleic acid molecules are contained and transferred from one environment to another or that facilitates the manipulation of a nucleic acid molecule. A vector may be, for example, a cloning vector, an expression vector, or a plasmid. Vectors include, for example, a BAC or a YAC vector. The term “expression vector” includes, without limitation, any vector, (e.g., a plasmid, cosmid or phage chromosome) containing a coding sequence suitable for expression by a cell (e.g., wherein the coding sequence is operatively linked to a transcriptional control element such as a promoter). A vector may comprise two or more nucleic acid molecules, in certain embodiments each of those two or more nucleic acid molecules comprises a nucleotide sequence that encodes a protein.
“Polypeptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. “Peptide” may be used to refer to a polymer of amino acids consisting of 1 to 50 amino acids. The polymer can be linear or branched, it can comprise modified amino acids, and it can be interrupted by non-amino acids. The terms also encompass an amino acid polymer that has been modified naturally or by intervention; for example, disulfide bond formation, glycosylation (except the O-glycosylation of modified carrier proteins), lipidation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, modification by non-naturally occurring amino acids, or any other manipulation or modification, such as conjugation with a labeling component. Also included within the definition are, for example, polypeptides containing one or more analogs of an amino acid (including, for example, unnatural amino acids, etc.), as well as other modifications known in the art.
A “Glycan” is a large carbohydrate molecule containing smaller sugar molecules and in certain embodiments herein refers to the oligosaccharide chain of a “glycoprotein” (a protein comprising glycan(s) covalently attached to amino acid side chains). “O-glycan” or “O-linked-glycan” is used herein to reference a glycan that is covalently attached to a serine or threonine residue of another molecule (i.e., the glycan is engaged in o-linked glycosylation). Glycans may be immunogenic. [3].
“Reducing end” of an oligosaccharide or polysaccharide is the monosaccharide with a free anomeric carbon that is not involved in a glycosidic bond and is thus capable of converting to the open-chain form. The first sugar (“S-1”) herein is that comprising the reducing end and the second sugar (“S-2”) is that which is adjacent to S-1. The S-2 sugar may be attached to the S-1 sugar by, for example, an α-(1→3), β-(1→3), β-(1→4), or α-(1→6) linkage (see [3]).
“Antigen” or “immunogen” herein refer to a substance, typically a protein or glycan, which is capable of inducing an immune response in a subject. In certain embodiments, an antigen is a protein (e.g., a glycoprotein) that is “immunologically active,” meaning that once administered to a subject (either directly or by administering to the subject a nucleotide sequence or vector that encodes the protein) it is able to evoke an immune response of the humoral and/or cellular type directed against that protein. “O-antigens” consist of repeats of an oligosaccharide unit (O-unit), which generally has between two and six sugar residues. [20]. O-antigens are components of the outer-membrane of gram-negative bacteria. [20]. In certain embodiments, the glycan is an O-antigen.
“Adjuvants” are non-antigen substances that enhance the induction, magnitude, and/or longevity of an antigen's immunological effect.
“Conjugation” references the coupling of carrier protein to saccharide (e.g., by covalent bond).
“Conjugate” herein means two or more molecules (e.g., proteins) which are attached to each other. The two or molecules are optionally recombinant molecules and/or are heterologous to each other. In certain embodiments, the conjugate comprises two or more molecules, the first being a carrier protein, for example a modified carrier protein, and the remaining one or more molecules being glycans covalently attached to a serine or threonine residue of the carrier protein. In certain embodiments, a conjugate comprises a glycosylated carrier protein, such as an O-glycosylated carrier protein, including an O-glycosylated modified carrier protein. A conjugate may be the result of chemical conjugation or in vitro conjugation (bioconjugation).
“Antibody” means an immunoglobulin molecule that recognizes and specifically binds to a target, such as a protein, polypeptide, peptide, carbohydrate, polynucleotide, lipid, or combinations of the foregoing through at least one antigen recognition site within the variable region of the immunoglobulin molecule. As used herein, the term “antibody” encompasses intact polyclonal antibodies, intact monoclonal antibodies, multispecific antibodies such as bispecific antibodies generated from at least two intact antibodies, chimeric antibodies, humanized antibodies, human antibodies, fusion proteins comprising an antibody, and any other modified immunoglobulin molecule so long as the antibodies exhibit the desired biological activity. An antibody can be of any the five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, or subclasses (isotypes) thereof (e.g. IgG1, IgG2, IgG3, IgG4, IgA1 and IgA2), based on the identity of their heavy-chain constant domains referred to as alpha, delta, epsilon, gamma, and mu, respectively. The different classes of immunoglobulins have different and well known subunit structures and three-dimensional configurations. Antibodies can be naked or conjugated to other molecules such as toxins, radioisotopes, etc. The term “antibody fragment” refers to a portion of an intact antibody. An “antigen-binding fragment” refers to a portion of an intact antibody that binds to an antigen. An antigen-binding fragment can contain the antigenic determining variable regions of an intact antibody. Examples of antibody fragments include, but are not limited to Fab, Fab′, F(ab′)2, and Fv fragments, linear antibodies, and single chain antibodies.
“Antibody response” means production of an anti-antigen antibody. “Inducing an antibody response” or “raising an antibody response” means stimulating in vivo the production of an anti-antigen antibody, e.g., an anti-O-antigen antibody or an anti-glycan-antibody.
“Identical” or percent “identity” as used in the context of two or more sequences is a reference to the number of nucleotides or amino acids which are the same over the entire length of the aligned sequence (for clarity, a conserved amino acid substitution in this context would not be “the same” but an analog of an amino acid, e.g., is “the same”). There are several known ways to calculate percent identity (see [21]). Unless stated otherwise, percentage identity “X” herein of a first amino acid sequence to a second sequence amino acid is calculated as (100×(Y/Z)), where “Y” is the number of “matches” (amino acid residues scored as identical matches in the alignment of the first and second sequences, as aligned by visual inspection or a particular sequence alignment program) and “Z” is the total number of aligned residues. Therefore, and unless stated otherwise, if the first amino acid sequence is shorter than the second amino acid sequence and percent identity is calculated over “the entire length of the sequence,” “Z” is equal to the length (in number of amino acids) of the first sequence.
The percent identity can be measured using sequence comparison software or algorithms or by visual inspection. Various algorithms and software are known in the art that can be used to obtain alignments of amino acid or nucleotide sequences. One such non-limiting example of a sequence alignment algorithm is the algorithm described in [22], as modified in [23], and incorporated into the NBLAST and XBLAST programs ([24]). In certain embodiments, Gapped BLAST can be used as described in [24]. BLAST-2, WU-BLAST-2 ([25], ALIGN, ALIGN-2 (Genentech, South San Francisco, Calif.) or Megalign (DNASTAR) are additional publicly available software programs that can be used to align sequences. In certain embodiments, the percent identity between two nucleotide sequences is determined using the GAP program in GCG software (e.g., using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 90 and a length weight of 1, 2, 3, 4, 5, or 6). In certain alternative embodiments, the GAP program in the GCG software package, which incorporates the algorithm of Needleman and Wunsch ([26]) can be used to determine the percent identity between two amino acid sequences (e.g., using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5). Alternatively, in certain embodiments, the percent identity between nucleotide or amino acid sequences is determined using the algorithm of Myers and Miller ([27]). For example, the percent identity can be determined using the ALIGN program (version 2.0) and using a PAM120 with residue table, a gap length penalty of 12 and a gap penalty of 4. Appropriate parameters for maximal alignment by particular alignment software can be determined by one skilled in the art. In certain embodiments, the default parameters of the alignment software are used.
As a non-limiting example, whether any particular polynucleotide or polypeptide has a certain percentage sequence identity (e.g., is at least 80% identical, at least 85% identical, at least 90% identical, and in some embodiments, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical) to a reference sequence can, be determined using known methods such as the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local homology algorithm of Smith and Waterman (Advances in Applied Mathematics 2: 482 489 (1981)) to find the best segment of homology between two sequences. When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed.
In some embodiments, two nucleic acids or polypeptides of the invention are substantially identical, meaning they have at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, and in some embodiments at least 95%, 96%, 97%, 98%, 99% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using a sequence comparison algorithm or by visual inspection. Identity can exist over a region of the sequences that is at least about 10, about 20, about 40-60 residues in length or any integral value there between, and can be over a longer region than 60-80 residues, for example, at least about 90-100 residues, and in some embodiments, the sequences are substantially identical “over the full length of” the sequences being compared, such as the coding region of a nucleotide sequence for example.
“Numbered with respect to”, “as compared to”, “numbered according to” is used herein to reference a location in an amino acid sequence while not being limited to that referenced amino acid sequence. It would therefore be understood, for example, that residue “128 numbered with respect to SEQ ID NO: 140” may encompass 129 of SEQ ID NO: 145 as well as 128 of SEQ ID NO: 163 (demonstrated below).
“Host cell” as used herein refers to a cell into which a molecule (usually a heterologous or non-native nucleic acid molecule) is, has been, or will be introduced. A host cell herein does not encompass a whole human organism (i.e., an “isolated host cell”).
PglL
Oligosaccharyltransferases (OSTs or OTases) are membrane-embedded enzymes that transfer oligosaccharides from a lipid carrier to a nascent protein (unlike glycosyltransferases in the cytoplasm, which assemble oligosaccharides by sequential action, OTases transfer glycan to protein en bloc [2]). O-linked glycosylation consists of the covalent attachment of a sugar molecule (a glycan) to a side-chain hydroxyl group of an amino acid residue (e.g. serine, or threonine) in the protein target (e.g., pilin). Pilin-glycosylation gene L (PglL) proteins from, for example Neisseria meningitidis, are OTases involved in O-linked glycosylation. In the periplasm of gram-negative bacteria, PglLs transfer the glycan from Und-PP-glycan to a pilin protein ([1]). Unlike PglB (N-glycosylation), PglL does not require a 2-acetamido group at position C-2 of the reducing end or a β1, 4 linkage between the first two sugars for activity and so is able to transfer virtually any glycan (Neisseria meningitidis PglL transfers, e.g., C. jejuni heptasaccharide, E. coli O7 antigen, E. coli K30 capsular structure, S. enterica O-antigen, and E. coli 016 peptidoglycan subunits to pilin in both E. coli and Salmonella cells) ([1], [3], [14], [16]). NmPglL and homologues thereof, such as PglL from Neisseria gonorrhoeae (called “PglO”, [6] and [19]) and PilO from Pseudomonas aeruginosa ([15]), are therefore substrate “promiscuous” (i.e., they have relaxed substrate specificity and so are able to transfer diverse oligo- and polysaccharides). [1] and [14] (per [3] and [16]). Neisseria meningitidis PglL (NmPglL) Homologues are described herein (see Examples) and known to the art: [17], [28], [18]).
“PglL OTase” herein encompasses Neisseria meningitidis PglL OTase as well as NmPglL OTase Homologues. Therefore, the term “PglL OTases” herein includes, for example, Neisseria meningitidis PglL (NmPglL) Oligosaccharyltransferase (OTase), Neisseria gonorrhoeae PglL (NgPglL) OTase, Neisseria lactamica 020-06 (NlPglL) OTase, Neisseria lactamica ATCC 23970 PglL (NlATCC23970PglL) OTase, and Neisseria gonorrhoeae F62 PglL (NgF62PglL) OTase.
“PglL Glycan Substrate”, “PglL Substrate” as used herein is a reference to a glycan which is transferrable by a PglL Otase (i.e., a glycan that is a substrate of PglL). See [1], [14], [29], [3], [16]. In certain embodiments, the PglL Glycan Substrate is attached to a lipid-carrier (“lipid-carrier-linked PglL Glycan Substrate”). In certain embodiments, the lipid-carrier is undecaprenol-pyrophosphate (UndPP), dolichol-pyrophosphate, or a synthetic equivalent thereof. In certain embodiments, the lipid-carrier is UndPP. In certain embodiments, the glycan is a “UndPP-linked PglL Substrate”. It is envisioned that a lipid-carrier-linked glycan is membrane-bound within a gram-negative host cell. A lipid-carrier-linked PglL Glycan Substrate being membrane bound may be said to be located “at the periplasm.” In certain embodiments, a NmPglL Glycan Substrate, a NgPglL Glycan Substrate, a NlPglL Glycan Substrate, or a NsPglL Glycan Substrate is specified. In certain embodiments, the PglL Glycan Substrate comprises a glycan having a reducing end of Glucose, Galactose, Galactofuranose, Rhamnose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, HexNAc, deoxy HexNAc, diNAcBac, or Pse. In certain embodiments the glycan is immunogenic (e.g., an “immunogenic PglL Glycan Substrate”). In certain embodiments the glycan is an O-antigen (e.g., a “PglL O-antigen Substrate”). See [1], [14], [29], [16], [30], [15].
Recombinant expression of a Neisserial PglL within a heterologous host cell is described herein and is known by the art (see [10], [8], [9], [29] (e.g., Table 1), [11], [1], [14], [5], [3], [31]; all incorporated herein by reference in their entireties).
Carrier Proteins
“Carrier protein” as used herein means a protein suitable for use as a carrier protein in the production of bioconjugate vaccines (e.g., [32]). “Carrier protein” as used herein is distinct from a “lipid carrier” (or “lipid-linked-carrier”), the latter of which include, without limitation, undecaprenyl-pyrophosphate (UndPP).
A “modified carrier protein” as used herein means a carrier protein that is altered (in one or more way) as compared to wild type (i.e., a “modified carrier protein” excludes a wild type pilin protein). A modified carrier protein includes, without limitation, a carrier protein incorporating one or more GlycoTag, purification tag, deletion (e.g., of at least a part of the transmembrane domain), insertion, and/or mutation (e.g., AcrA mutation(s) ([33]). In certain embodiments, the modified carrier protein is altered as compared to a control carrier protein (e.g., wild type) such that the modified carrier protein may be an “acceptor” of the PglL Glycan Substrate (i.e., accept the PglL Glycan Substrate directly from PglL without pilin intermediate). In certain embodiments, one such modified carrier protein is altered by comprising one or more GlycoTags. In certain embodiments, one such modified carrier protein comprises one or more GlycoTags at its N-terminus, C-terminus, and/or interior residues. For clarity, “a modified carrier protein comprising a carrier protein having one or more GlycoTags at its N-terminus and/or C-terminus” means “a modified carrier protein comprising a carrier protein operably linked to one or more GlycoTags at its N-terminus and/or C-temrinus.”
In certain embodiments, the modified carrier protein is covalently coupled to a glycan, either directly (e.g., via an O-linked glycosidic bond) or indirectly (e.g., via a linker), wherein the coupling is at one or more of the GlycoTags. In further embodiments, the glycan is a PglL Glycan Substrate. In certain embodiments, the modified carrier protein is coupled to a Shigella glycan (e.g. a Shigella sonnei glycan (such as S. sonnei O-antigen), or e.g. a Shigella flexneri glycan (such as Shigella flexneri 2a CPS), or a Shigella dysenteriae glycan). In certain embodiments, the modified carrier protein is coupled to a Streptococcus glycan (e.g. Streptococcus pneumoniae (such as Streptococcus pneumoniae sp. 12F CPS, S. pneumoniae sp. 8 CPS, S. pneumoniae sp. 14 CPS, S. pneumoniae sp. 23A CPS, S. pneumoniae sp. 33F CPS, or S. pneumoniae sp. 22A CPS)).
“O-glycosylated modified carrier protein” means the modified carrier protein is glycosylated and, in particular, is engaged in O-linked glycosylation (e.g., a modified carrier protein that is O-linked to a PglL Glycan Substrate).
An O-glycosylated modified carrier protein may be directly or indirectly attached to two or more distinct immunogenic glycans and, in this way, useful for inducing an immune or antibody response to the two or more immunogenic glycans (i.e., multivalent).
Exemplary carrier proteins include, without limitation, detoxified Exotoxin A of P. aeruginosa (“EPA”; see, e.g., [4]), CRM197, maltose binding protein (MBP), Diphtheria toxoid (DT), Tetanus toxoid (TT), Tetanus Toxin C fragment (TTc), detoxified hemolysin A of S. aureus, clumping factor A, clumping factor B, E. coli FirmH, E. coli FirmHC, E. coli heat labile enterotoxin, detoxified variants of E. coli heat labile enterotoxin, Cholera toxin B subunit (CTB), cholera toxin, detoxified variants of cholera toxin, E. coli Sat protein, the passenger domain of E. coli Sat protein, Streptococcus pneumoniae Pneumolysin and detoxified variants thereof, C. jejuni Acriflavine resistance protein A (CjAcrA), E. coli Acriflavine resistance protein A (EcAcrA), Pseudomonas aeruginosa PcrV protein (PcrV), C. jejuni natural glycoproteins, S. pneumoniae NOX, S. pneumoniae PspA, S. pneumoniae PcpA, S. pneumoniae PhtD, S. pneumoniae PhtE, S. pneumoniae ply (e.g. detoxified ply), S. pneumoniae LytB, Haemophilus influenzae protein D (PD). [34], [35], [36]. In certain embodiments, the carrier protein is selected from the group consisting of CTB, TT, TTc, DT, CRM197, EPA, EcAcrA, CjAcrA, and PcrV. In certain embodiments, the carrier protein is selected from the group consisting of EPA, EcAcrA, CjAcrA, and PcrV. In certain embodiments, the carrier protein is EPA. In certain embodiments, the carrier protein is EcAcrA.
In certain embodiments, the carrier protein is protein D from Haemophilus influenzae (PD), for example, protein D sequence from FIG. 9 of [37] (FIG. 9a and 9b together, 364 amino acids). Inclusion of this protein in the immunogenic composition may provide a level of protection against Haemophilus influenzae related otitis media ([38]). The Protein D may be used as a full length protein or as a fragment (for example, Protein D may be as described in [39]). For example, a protein D sequence may comprise (or consist) of the protein D fragment as described in [37] lacking the 19 N-terminal amino acids from
In an embodiment, the carrier protein is CRM197. CRM197 is a non-toxic form of the diphtheria toxin but is immunologically indistinguishable from the diphtheria toxin (DT). Genetically detoxified analogues of diphtheria toxin include CRM197 and other mutants described in U.S. Pat. Nos. 4,709,017, 5,843,711, 5,601,827, and 5,917,017. CRM197 is produced by C. diphtheriae infected by the nontoxigenic phase β197tox-created by nitrosoguanidine mutagenesis of the toxigenic carynephage b ([40]). The CRM197 protein has the same molecular weight as the diphtheria toxin but differs from it by a single base change in the structural gene. This leads to a glycine to glutamine change of amino acid at position 52 which makes fragment A unable to bind NAD and therefore non-toxic ([41], [42]).
In an embodiment, the carrier protein is Tetanus Toxoid (TT). Tetanus toxin is a single peptide of approximately 150 kDa, which consists of 1315 amino-acid residues. Tetanus-toxin may be cleaved by papain to yield two fragments; one of them, fragment C, is approximately 50 kDa. Fragment C of TT is described in [43].
In an embodiment, the carrier protein is dPly (detoxified pneumolysin). Pneumolysin (Ply) is a multifunctional toxin with a distinct cytolytic (hemolytic) and complement activation activities ([44]). The toxin is not secreted by pneumococci, but it is released upon lysis of pneumococci under the influence of autolysin. Its effects include e.g., the stimulation of the production of inflammatory cytokines by human monocytes, the inhibition of the beating of cilia on human respiratory epithelial, the decrease of bactericidal activity and migration of neutrophils, and in the lysis of red blood cells, which involves binding to cholesterol. Because it is a toxin, it needs to be detoxified (i.e., non-toxic to a human when provided at a dosage suitable for protection) before it can be administered in vivo. Expression and cloning of wild-type or native pneumolysin is known in the art. See, for example, [45], [46], and [47]. Detoxification of Ply can be conducted by chemical means, e.g., subject to formalin or glutaraldehyde treatment or a combination of both ([48], [49]). Such methods are known in the art for various toxins. Alternatively, Ply can be genetically detoxified (altered so that it is biologically inactive whilst still maintaining its immunogenic epitopes, e.g., [50], [51], and [52]. As used herein, it is understood that the term “Ply” encompasses mutated pneumolysin and detoxified pneumolysin (dPly) suitable for pharmaceutical use (i.e., non toxic).
Nucleic acids encoding the carrier protein can be introduced into a host cell for the production of a bioconjugate comprising a carrier protein. For use in in vivo bioconjugation within a gram-negative bacterium, carrier proteins are located within the periplasm. A carrier protein may be targeted to the periplasm by use of a periplasmic signal sequence. Periplasmic signal sequence structure and use (including their cleavage, codon optimization, and recombinant attachment to a heterologous protein) is known in the art. See, e.g., [53], [54], [5], and [34]. Codon optimization, generally, is also well known in the art and, unless stated otherwise (including Examples), it is envisioned that codon optimization is utilized for any recombinant expression of the present invention. See, e.g., [55], [56], [57], [58], [59] [60].
Signal sequences, including periplasmic signal sequences, are usually removed during translocation of the protein into, for example, the periplasm by signal peptidases (i.e., a mature protein is a protein from which at least the signal sequence has been removed). “Targeted to the periplasm” is used herein to acknowledge that signal sequences are usually removed. In this way, a protein which is “targeted to the periplasm” includes both the protein operably linked to the periplasmic signal sequence and the mature protein from which the periplasmic signal sequence has already been removed.
Periplasmic signal sequences are well known in the art. In certain embodiments, the periplasmic signal is that of Erwinia carotovorans pectatelyase B (pelB), E. coli outer membrane porin A (OmpA), E. coli disulfide oxidoreductase (DsbA), E. coli Tol-Pal cell envelop complex (TolB), E. coli maltose binding protein subunit (MalE), E. coli flagellin (Flgl), Heat-liable enterotoxin (LtIIb) (e.g., E. coli LtIlb), SipA (e.g., Streptococcus pyogenes SipA, Clostridium acidurici SipA, Bacillus amyloliquefaciens SipA), E. coli nickel-binding protein NikA (NikA), Bacillus sp. Endoxylanase (XynA), E. coli Heast Stable Enterotoxin II (STII), or E. coli alkaline phosphatase subunit (PhoA). [5]. In certain embodiments, the periplasmic signal sequence is PelB, OmpA, DsbA, TolB, or MalE. In certain embodiments, the periplasmic signal sequence is DsbA.
In certain embodiments, the carrier proteins comprise a “tag,” i.e., a sequence of amino acids that allows for the detection, isolation and/or identification of the carrier protein. For example, adding a tag to a carrier protein can be useful in the purification of that protein and, hence, the purification of a bioconjugate comprising the tagged carrier protein. Exemplary tags that can be used herein include, without limitation, histidine (HIS) tags (e.g., hexa histidine-tag, or 6×His-Tag), FLAG-TAG, and HA tags also strep tag, myc tag, or combinations thereof. In certain embodiments, the tags used herein are removable, e.g., removal by chemical agents or by enzymatic means, once they are no longer needed, such as after the protein has been purified.
A “purification tag” as used herein refers to a ligand that aids protein purification with, for example, size exclusion chromatography, ion exchange chromatography, and/or affinity chromatography. Purification tags and their use are well known to the art (see, e.g., [61], [62]) and may be, for example, poly-histidine (HIS), glutathione S-transferase (GST), c-Myc (Myc), hemagglutinin (HA), FLAG, or maltose binding protein (MBP). In certain embodiments, apurification tag is an epitope tag (which include, e.g., a histidine, FLAG, HA, Myc, V5, Green Fluorescent Protein (GFP), GSK, β-galactosidase (b-GAL), luciferase, Maltose Binding Protein (MBP), or Red Fluorescence Protein (RFP) tag). In certain embodiments, polypeptides are operably linked to one or more purification tags (including combinations of purification tags). A step of purifying, collecting, obtaining, or isolating a protein may therefore include size exclusion chromatography, ion exchange chromatography, or affinity chromatography. In certain embodiments, a step of purifying a modified carrier protein (or a conjugate comprising it), utilizes affinity chromatography and, for example, a σ28 affinity column or an affinity column comprising an antibody that binds the modified carrier protein or the conjugate comprising it (optionally by binding to the glycn). In a certain embodiment, a step of purifying a fusion protein comprising at least a modified carrier protein operably linked to a purification tag utilizes affinity chromatography and, for example, an affinity column that binds the purification tag.
GlycoTags
“GlycoTag” as used herein is a recombinant O-linked glycosylation site and consists of a fragment of a pilin amino acid sequence. In this way, the term “Glycotag” is used to refer to a recombinant amino acid sequence (i.e., separated from a wild type pilin) whereas “sequon” may be used to refer to that same sequence that is located within a wild type pilin (i.e., not separated from a wild type pilin).
The use of multiple GlycoTags within one carrier protein is envisioned (see Examples), optionally, multiple GlycoTags being adjacent to each other. Two or more GlycoTags may be separated by a “Amino Acid Linker” consisting of one or more amino acids, which can be, for example, one or more glycine ([63]), one or more serine, and/or combinations thereof (See [64]). An “amino acid linker” herein is a type of “linker”.
O-glycosylation efficiency of GlycoTags located at the N- or C-terminus of a carrier protein may be increased by flanking the GlycoTag (i.e., placing toward the N-terminus and/or toward the C-terminus of the GlycoTag) with one or more “Flanking Peptide” (a peptide comprising hydrophilic amino acids such as, for example, DPRNVGGDLD (residues 599-608 of SEQ ID NO: 1) or QPGKPPR (residues 628-634 of SEQ ID NO: 1)). [3]. Such Flanking Peptide may be adjacent to the GlycoTag (i.e., with no amino acids between the GlycoTag and the Flanking Peptide) or may have 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acids between it and the GlycoTag. An insertion of two or more Flanking Peptides can be used. Flanking Peptides can be used to increase the O-glycosylation efficiency of shorter GlycoTags, such as those having the sequence SEQ ID NO: 142, 147, 151, or 164 (all 12 amino acids long).
Hydrophilic amino acids herein include arginine (R), lysine (K), aspartic acid (D), glutamic acid (E), glutamine (Q), asparagine (N), histidine (H), serine (S), threonine (T), tyrosine (Y), cysteine (C), and tryptophan (W).
Glycans
A glycan is any sugar that can be transferred (e.g, covalently attached) to a carrier protein. A glycan comprises monosaccharides, oligosaccharides and polysaccharides. An oligosaccharide is a glycan having 2 to 10 monosaccharides. A polysaccharide is a glycan having greater than 10 monosaccharides. Polysaccharides can be selected from the group consisting of O-antigens, capsules, and exopolysaccharides.
Glycans for use with the present invention are PglL Otase substrates. [1], [14], [29], [16], [30], and [15]. In certain embodiments, the glycan is operably linked to a lipid-carrier. In certain embodiments, the glycan can be, but is not limited to, hexoses, N-acetyl derivatives of hexoses, oligosaccharides, and polysaccharides. In certain embodiments, the monosaccharide at the reducing end of the glycan is a hexose or an N-acetyl derivative of a hexose. In a certain embodiment, the glycan comprises a hexose monosaccharide at its reducing end such as glucose, galactose, rhamnose, arabinotol, fucose or mannose. In certain embodiments, the hexose monosaccharide at the reducing end is glucose or galactose. In certain embodiments, the reducing end of the glycan is an N-acetyl derivative of hexose. In general, N-acetyl derivatives of hexose (or “hexose monosaccharide derivatives”) comprise an acetamido group at position 2. In certain embodiments, N-acetyl derivatives of hexose is selected from N-acetylglucosamine (GlcNAc), N-acetylhexosamine (HexNAc), deoxy HexNAc, and 2,4-diacetamido-2,4,6-trideoxyhexose (DATDH), N-acetylfucoseamine (FucNAc), and N-acetylquinovosamine (QuiNAc). In certain embodiments, the N-acetyl derivative of hexose is selected from N-acetylglucosamine (GlcNAc), N-acetylgalactosamine (GalNAc), N-acetylfucoseamine (FucNAc), 2,4-diacetarnido-2,4,6-trideoxyhexose (DATDH), glyceramido-acetamido trideoxyhexose (GATDH), and N-acetylhexosamine (HexNAc). In certain embodiments, the glycan has a reducing end of N,N-diacetylbacillosamine (diNAcBac) or Pseudaminic acid (Pse). In certain embodiments, the glycan is one that has a reducing end of Glucose, Galactose, arabinotol, fucose, mannose, Galactofuranose, Rhamnose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, HexNAc, deoxy HexNAc, QuiNAc, diNAcBac, or Pse. In certain embodiments, the glycan is one that has a reducing end of Glucose, Galactose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, HexNAc, deoxy HexNAc, or diNAcBac. In certain embodiments, the glycan is one that has a reducing end of Glucose, Galactose, Galactofuranose, Rhamnose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, or diNAcBac. In certain embodiments, the glycan is one that has a reducing end of Glucose, Galactose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, or diNAcBac. In certain embodiments, the glycan is one that has a reducing end selected from the group consisting of DATDH, GlcNAc, GalNAc, FucNAc, Galactose, and Glucose. In certain embodiments, the glycan is one that has a reducing end GlcNAc, GalNAc, FucNAc, or Glucose. In certain embodiments, the glycan is one that has a S-2 to S-1 reducing end of Galactose-β1,4-Glucose; Glucuronic acid-β1,4-glucose; N-acetyl-fucosamine-α1,3-N-acetyl-galactosamine; Galactose-β1,4-glucose; Rhamnose-β1,4-glucose; Galactofuranose-β1,3-glucose; N-acetyl-altruronic acid-α1,3-4-amino-N-acetyl-fucosamine; or Rhamnose-β1,4-N-acetylgalactosamine.
In certain embodiments, the glycan is endogenous to a Neisseria, Shigella, Salmonella, Streptococcus, Escherichia, Pseudomonas, Yersinia, Campylobacter, or Heliobacter cell. In certain embodiments, the glycan is endogenous to a Shigella, Salmonella, Escherichia, or Pseudomonas cell. In certain embodiments, the glycan is endogenous to a Shigella flexneri, Salmonella paratyphi, Salmonella enterica, or E. coli cell. In certain embodiments, the glycan is from C. jejuni, N. meningitidis, P. aeruginosa, S. enterica LT2, or E. coli. See [3], [29], [1], [14].
In certain embodiments, the glycan is an immunogenic glycan (an antigen). In certain embodiments, the glycan is an O-antigen. In certain embodiments, the glycan is an immunogenic O-antigen endogenous to a Neisseria, Shigella, Salmonella, Streptococcus, Escherichia, Pseudomonas, Yersinia, Campylobacter, or Heliobacter cell. In further embodiments, the PglL Glycan Substrate is a Shigella sonnei glycan antigen e.g. S. sonnei O-antigen, a Shigella flexneri glycan antigen e.g. Shigella flexneri 2a CPS, a Shigella dysenteriae glycan antigen, a Streptococcus pneumoniae glycan antigen e.g. Streptococcus pneumoniae sp. 12F CPS, S. pneumoniae sp. 8 CPS, S. pneumoniae sp. 14 CPS, S. pneumoniae sp. 23A CPS, S. pneumoniae sp. 33F CPS, or S. pneumoniae sp. 22A CPS. In certain embodiments, the glycan is a Streptococcus pneumoniae glycan having a reducing end of Glucose, Galactose, arabinotol, fucose, mannose, Galactofuranose, Rhamnose, GlcNAc, GalNAc, FucNAc, DATDH, GATDH, HexNAc, deoxy HexNAc, QuiNAc, diNAcBac, or Pse. In certain embodiments, the glycan is a Streptococcus pneumoniae glycan is one that has a S-2 to S-1 reducing end of Galactose-β1,4-Glucose; Glucuronic acid-β1,4-glucose; N-acetyl-fucosamine-α1,3-N-acetyl-galactosamine; Galactose-β1,4-glucose; Rhamnose-β1,4-glucose; Galactofuranose-β1,3-glucose; N-acetyl-altruronic acid-α1,3-4-amino-N-acetyl-fucosamine; or Rhamnose-β1,4-N-acetylgalactosamine. The CP gene clusters of all 90 S. pneumoniae serotypes have been sequenced by Sanger Institute (http://WorldWideWeb(www).sanger.ac.uk/Projects/S_pneumoniae/CPS/). Sequences are provided in NCBI as Genbank CR931632-CR931722. The capsular biosynthetic genes of S. pneumoniae are further described in Serotype 23A from Streptococcus pneumoniae strain 1196/45 (serotype 23a) as NCBI GenBank accession number: CR931683.1. Serotype 23B from Streptococcus pneumoniae strain 1039/41 as NCBI GenBank accession number: CR931684.1. Serotype 23F from Streptococcus pneumoniae strain Dr. Melchior as NCBI GenBank accession number: CR931685.1.
In certain embodiments, the glycan is an S. sonnei O-antigen. In certain embodiments, the S. sonnei O-antigen consists of a wbgT protein, a wbgU protein, a wzx protein, a wzy protein, a wbgV protein, a wbgW protein, a wbgX protein, a wbgY protein, and a wbgZ protein. In certain embodiments, the S. sonnei O-antigen consists of a wbgT protein having at least 90% identity to SEQ ID NO: 108, a wbgU protein having at least 90% identity to SEQ ID NO: 109, a wzx protein having at least 90% identity to SEQ ID NO: 110, a wzy protein having at least 90% identity to SEQ ID NO: 111, a wbgV protein having at least 90% identity to SEQ ID NO: 112, a wbgW protein having at least 90% identity to SEQ ID NO: 113, a wbgX protein having at least 90% identity to SEQ ID NO: 114, a wbgY protein having at least 90% identity to SEQ ID NO: 115, and a wbgZ protein having at least 90% identity to SEQ ID NO: 116).
Applications Thereof
Conjugation
As provided herein, the modified carrier proteins can be used for bioconjugation. In certain embodiments, the modified carrier proteins can be used for in vivo bioconjugation within a gram-negative bacterial host cell. In certain embodiments, the modified carrier proteins can be used for conjugate production by incubating the modified carrier protein with a Neisserial PglL and a PglL glycan substrate, optionally in a suitable buffer.
In Vivo Bioconjugation
In certain embodiments, O-glycosylated modified carrier proteins are produced using in vivo methods and systems. In certain embodiments, an O-glycosylated modified carrier protein (or bioconjugate) is made and then isolated from the periplasm of the host cell. In vivo conjugation (“bioconjugation”) of the present invention utilizes known methodologies for recombinant protein expression within a gram-negative bacterial cell and isolation therefrom, including sequence selection and optimization, vector design, cloning plasmids, culturing parameters, and periplasmic purification techniques. See, e.g., [65], [3], [5], [7], [8], [9], [10], [11], [1], [14], [4], [63], and [31]. Methods of producing bioconjugates using host cells are described in, for example, [66] and [67]. Bioconjugation offers advantages over in vitro chemical conjugation in that bioconjugation requires less chemicals for manufacture and is more consistent in terms of the final product generated.
Gram-negative bacterial cells for use with the present invention include, but are not limited to, a cell from the genera Neisseria, Shigella, Salmonella, Escherichia, Pseudomonas, Yersinia, Campylobacter, Vibrio, Klebsiella, or Helicobacter. In certain embodiments, the host cell is selected from the group consisting of Neisseria, Shigella, Salmonella, Escherichia, Pseudomonas, Yersinia, Campylobacter, and Helicobacter cells. In certain embodiments, the host cell is selected from the group consisting of Shigella, Salmonella, and Escherichia cells. In an embodiment, the gram-negative bacterial cell is classified as a Neisseria ssp., Shigella ssp., Salmonella ssp., Escherichia ssp., Pseudomonas ssp., Yersinia ssp., Campylobacter ssp., Vibrio ssp., Klebsiella ssp., or Helicobacter ssp. cell. The gram-negative bacterial host cell may be classified as a Neisserial ssp. cell other than Neisseria elongata. In a further embodiment, the gram-negative bacterial cell is a Shigella flexneri, Salmonella paratyphi, Salmonella enterica, E. coli, or Pseudomonas aeruginosa cell. In an embodiment, the host cell is selected from the group consisting of Shigella flexneri, Salmonella paratyphi, and Escherichia coli cells. In certain embodiments, the host cell is a Vibrio cholerae cell. In certain embodiments, the host cell is an Escherichia coli cell. In an embodiment, the gram-negative bacterial cell originated from E. coli strain K12, Top10, W3110, CLM24, BL21, SCM6 or SCM7. In certain embodiments, the host cell is a Shigella flexneri cell. In certain embodiments, the host cell is a Salmonella enterica cell. In an embodiment, the gram-negative bacterial cell originated from S. enterica strain SL3261, SL3749, SL326iδwaaL, or SL3749. In certain embodiments, the host cell is a Salmonella paratyphi cell. In certain embodiments, the host cell is a Pseudomonas aeruginosa cell. See [10], [8], [9], [29] at e.g. Table 1 and [11]; [3], [31], [5], [1], [14].
In certain embodiments, the gram-negative bacterial cell is modified such that the cell's endogenous (periplasmic) O-antigen ligase (or “endogenous PglL homologue”) is reduced (deficient or “knockdown”) or knocked-out (KO) in expression or function as compared to control (e.g., wild type). In certain embodiments, “reduction of endogenous PglL homologue” or “the endogenous PglL homologue is reduced” is used to mean a reduction (e.g., a knockdown), which encompasses a knock-out, of the expression or function of the endogenous PglL homologue. In that way, a gram-negative bacterial cell of the present invention may be deficient in its endogenous PglL homologue. For example, the WaaL gene of E. coli and that of Salmonella enterica are functional homologues of N. meningitidis PglL ([17], [28], and [68]). It is therefore envisioned that, for example, an Escherichia or Salmonella host cell for use with the present invention is modified such that the expression or function of WaaL is at least reduced as compared to a control (optionally wild type) Escherichia or Salmonella cell under essentially the same conditions. In certain embodiments, the host cell's endogenous PglL gene (e.g., the waaL gene) has been replaced by a heterologous nucleotide sequence encoding an oligosaccharyltransferase. Techniques for knocking down or knocking out an endogenous PglL homologue are known and include, for example, mutation or deletion of the gene encoding the endogenous PglL homologue. See the Examples and, e.g., [3]; see also [18].
Host cells of the present invention may utilize endogenous or heterologous glycosyltransferases for sequential assembly of oligosaccharides in the cytosol (cytosolic glycosyltransferases). Such glycosyltransferases include, for example, Neisseria PglD, PglC, PglB/PglB2, and PglA shown at
“O-glycosylation Machinery” is used to collectively reference the molecules (e.g. glycosyltransferases, flippases, polymerases, oligosaccharyltransferases including gene clusters and organelles) and processes for O-glycosylation which are well known to the art. See, e.g., [69], [3], [5], [31], [10], [8], [9], [29], [11]. In certain embodiments, a gram-negative bacterial host cell comprises O-glycosylation machinery that are endogenous, heterologous, or combinations thereof, to the host cell. In a fcurther embodiment, a gram-negative bacterial host cell comprises O-glycosylation machinery with the proviso that the cell's endogenous PglL or PglL homologue is reduced as compared to control. In a fcurther embodiment, a gram-negative bacterial host cell comprises endogenous O-glycosylation machinery with the proviso that the cell's endogenous PglL or PglL homologue is reduced as compared to control. In a certain embodiment the E. coli or S. enterica gram-negative host cell comprises endogenous O-glycosylation machinery with the proviso that the cell's PglL homologue WaaL is reduced as compared to control.
Again, codon optimization is well known in the art and, unless stated otherwise (including Examples), it is envisioned that codon optimization is utilized for any recombinant expression of the present invention.
The expression of the transgenes of the present invention can be under the control of a transcription control element (TCE) which includes, for example, a promoter. In certain embodiments, the transgene is under the control of a constitutive promoter or of an inducible promoter, which initiates transcription only when exposed to some particular external stimulus, such as, without limitation, antibiotics such as tetracycline, hormones such as ecdysone, or heavy metals. The promoter can also be specific to a particular cell-type, tissue or organ. Many suitable promoters and enhancers are known in the art, and any such suitable promoter or enhancer may be used for expression of the transgenes of the invention. Promoters for use with the present invention are known and include, without limitation, ParaBAD, arabinose, tac-promoter (Ptac), and constitutive promoters (including native constitutive promoters) ([4]; see also [10], [8], [9], [29], [11]). In certain embodiments, the promoter is a ParaBAD or arabinose promoter.
The incorporation of a nucleic acid molecule into a gram-negative bacterial cell can be performed using any number of techniques known in the art, including those for stable transfection or transformation of a nucleic acid molecule or vector into a host cell. See the references cited above and the techniques listed and described in [70]. Recombinant nucleic acids can be introduced into the host cells of the invention using methods such as electroporation, chemical transformation by heat shock, natural transformation, phage transduction, and conjugation. In certain embodiments, recombinant nucleic acids are introduced into a host cell using a plasmid (e.g. the recombinant nucleic acids are expressed in the host cell by a plasmid such as an expression vector). In another embodiment, recombinant nucleic acids are introduced into a host cell using the method of insertion described in [71].
Gram-negative bacterial cells incorporating the glycosyltransferases, modified carrier proteins, PglL Otases, or PglL Glycan Substrates of this invention can be grown using various methods known in the art, for example, grown in a broth culture. The modified carrier proteins or O-glycosylated modified carrier proteins produced by the cells can be isolated using various methods known in the art, for example, lectin affinity chromatography ([1]).
An O-glycosylated modified carrier protein may be purified (to remove host cell impurities and unglycosylated carrier protein) and optionally characterized by techniques known in the art (see, e.g., [4], [72]; see also [10], [8], [9], [29], and [11]). Purification of a bioconjugate may be by cell lysis (including, e.g., one or more centrifugation steps) followed by one or more isolation steps (including, e.g., one or more chromatography steps or a combination of fractionation, differential solubility, centrifugation, and/or chromatography steps). Said one or more chromatographic steps may comprise ion exchange, anionic exchange, affinity, and/or sizing column chromatography, such as Ni2+ affinity chromatography and/or size exclusion chromatography. In a certain embodiment, one or more chromatographic steps comprises ion exchange chromatography. Therefore, one or more of the purified polypeptides may be operably linked to a tag (a purification tag). For example, affinity column IMAC (Immobilized metal ion affinity chromatography) may be used to bind the poly-histidine tag operably linked to the carrier protein, followed by anion exchange chromatography and size exclusion chromatography (SEC). For example, purification of a bioconjugate may be by osmotic shock extraction followed by anionic and/or size exclusion chromatography ([7]); or by osmotic shock extraction followed by Ni-NTA affinity and fluoroapatite chromatography ([4]).
In Vitro Conjugation
To produce O-glycosylated modified carrier proteins in vitro, the PglL OTase can be incubated with the modified carrier protein and PglL glycan substrate in, for example, a buffer. In certain embodiments, the bugger has a pH of approximately 6 to approximately 8. In one aspect, the buffer may be phosphate buffer saline. In another aspect, the buffer may be Tris-HCl 50 mM, having a pH of 7.5.
In certain embodiments, chemical conjugation using known protocols is used (e.g., [73], [74], [75]). Thereby, a glycan may be covalently linked (either directly or through a linker) to an amino acid residue of a modified carrier protein. “Directly linked” herein means that the two entities are connected via a chemical bond, for example a covalent bond. “Indirectly linked” herein means that the two entities are connected via a linking moiety (“linker”) (as opposed to a direct covalent bond). In certain embodiments the linking moiety is adipic acid dihydrazide. In certain embodiments, the PglL glycan substrate is covalently linked to a modified carrier protein (directly or via a linker) through a chemical linkage obtainable using a chemical conjugation method selected from the group consisting of carbodiimide chemistry, reductive animation, cyanylation chemistry (for example CDAP chemistry), maleimide chemistry, hydrazide chemistry, ester chemistry, and N-hydroxysuccinimide chemistry. Conjugates can be prepared by direct reductive amination methods as described in, [76], [77]. Other methods are described in [78], [79], [80]. The conjugation method may alternatively rely on activation of the glycan with 1-cyano-4-dimethylamino pyridinium tetrafluoroborate (CDAP) to form a cyanate ester. Such conjugates are described in [81], [82], [83]. See also [84].
The glycosylated protein (i.e., conjugate) can then be purified, and optionally characterized, by techniques known in to art (see, e.g., [4], [72]; see also [8], [9], [10], [11]).
Conjugates
The O-glycosylated modified carrier proteins of the present invention can be used as therapeutic agents for the treatment of a number of diseases where an effective amount of the O-glycosylated modified carrier protein is administered to a subject in need of such treatment. The O-glycosylated modified carrier proteins of the present invention can also be used as a vaccine or in an immunogenic composition for the prevention of a disease when an effective amount of the O-glycosylated modifiec carriier protein is administered to a subject in need of such treatment. Thus, the methods described herein for producing of a number of different O-glycosylated modified crrier proteins will prove very useful in vaccinology.
“Homogeneity” means the variability of glycan length and possibly the number of glycosylation sites. Methods listed above can be used for this purpose. SE-HPLC allows the measurement of the hydrodynamic radius. Higher numbers of glycosylation sites in the carrier lead to higher variation in hydrodynamic radius compared to a carrier with less glycosylation sites. However, when single glycan chains are analyzed, they may be more homogenous due to the more controlled length. Glycan length is measured by hydrazinolysis, SDS PAGE, and CGE. In addition, homogeneity can also mean that certain glycosylation site usage patterns change to a broader/narrower range. These factors can be measured by Glycopeptide LC-MS/MS.
“Bioconjugate homogeneity” means the homogeneity of the attached sugar residues and can be assessed using methods that measure glycan length and hydrodynamic radius.
“Yield” is measured as carbohydrate amount derived from a liter of bacterial production culture grown in a bioreactor under controlled and optimized conditions. After purification of bioconjugate, the carbohydrate yields can be directly measured by either the anthrone assay or ELISA using carbohydrate specific antisera. Indirect measurements are possible by using the protein amount (measured by BCA, Lowry, or bardford assays) and the glycan length and structure to calculate a theoretical carbohydrate amount per gram of protein. In addition, yield can also be measured by drying the glycoprotein preparation from a volatile buffer and using a balance to measure the weight.
Analytical Methods
Various methods can be used to analyze the glycans and conjugates of the invention including, for example, SDS-PAGE or capillary gel electrophoresis. O-antigen polymer length is defined by the number of repeat units that are linearly assembled. This means that the typical ladder like pattern is a consequence of different repeat unit numbers that compose the glycan. Thus, two bands next to each other in SDS PAGE (or other techniques that separate by size) differ by only a single repeat unit. These discrete differences are exploited when analyzing glycoproteins for glycan size: the unglycosylated carrier protein and the bioconjugate with different polymer chain lengths separate according to their electrophoretic mobilities. The first detectable repeat unit number (n1) and the average repeat unit number (naverage) present on a bioconjugate are measured. These parameters can be used to demonstrate batch to batch consistency or polysaccharide stability, for example.
In another embodiment, high mass MS and size exclusion HPLC could be applied to measure the size of the complete bioconjugates.
In another embodiment, an anthrone-sulfuric acid assay can be used to measure polysaccharide yields. See [85]. In another embodiment, a Methylpentose assay can be used to measure polysaccharide yields. See, e.g. [86].
Glycosylation Site Usage
Glycosylation site usage may be quantified by, for example, glycopeptide LC-MS/MS: conjugates are digested with protease(s), and the peptides are separated by a suitable chromatographic method (C18, Hydrophilic interaction HPLC HILIC, GlycoSepN columns, SE HPLC, AE HPLC), and the different peptides are identified using MS/MS. This method can be used with our without previous sugar chain shortening by chemical (smith degradation) or enzymatic methods. Quantification of glycopeptide peaks using UV detection at 215 to 280 nm allow relative determination of glycosylation site usage. In another embodiment, by size exclusion HPLC: Higher glycosylation site usage is reflected by a earlier elution time from a SE HPLC column.
Compositions
Compositions comprising a modified carrier protein are provided. In certain embodiments, the modified carrier protein is O-glycosylated. In certain embodiments, the glycan operably linked to the modified carrier protein is immunogenic and the composition is therefore an immunogenic composition.
An “immunogenic composition”, “vaccine composition,” or “pharmaceutical composition” is a preparation formulated to permit the biological activity of the active ingredient to be effective, and which contains no additional components which are unacceptably toxic to a subject to which the composition would be administered. Immunogenic, vaccine, or pharmaceutical compositions comprise pharmaceutical-grade active ingredients (e.g., pharmaceutical-grade antigen), therefore, the immunogenic, vaccine, or pharmaceutical compositions of the present invention are distinguished from any, e.g., naturally occurring composition. See [87]. In certain embodiments, the immunogenic, vaccine, or pharmaceutical composition is sterile. In certain embodiments, the composition is an immunogenic composition comprising an “immunogenic conjugate” (e.g., a modified carrier protein covalently linked to an immunogenic glycan). In certain embodiments, the immunogenic glycan is an O-antigen. Immunogenic compositions comprise an immunologically effective amount of the immunogenic glycan or immunogenic conjugate. An “immunologicaly effective amount” may be administered to an individual as a single dose or as part of a series. In certain embodiments, the immunogenic composition further comprises a pharmaceutically acceptable adjuvant, excipient, carrier, or diluent. Adjuvants, excipients, carriers, and diluents do not themselves induce an antibody or immune response, but rather they provide the technical effect of eliciting or enhancing an antibody or immune response to an antigen (e.g., an immunogenic glycan).
In an embodiment, the immunogenic compositions of the invention are monovalent formulations. In other embodiments, the immunogenic compositions of the invention are multivalent formulations, e.g. bivalent, trivalent, and tetravalent formulations. For example, a multivalent formulation comprises two or more immunogenic modified carrier proteins (e.g., a first immunogenic O-glycosylated modified carrier protein comprising a first immunogenic glycan and an at least second immunogenic O-glycosylated modified carrier protein comprising a second immunogenic glycan, optionally further comprising a third immunogenic O-glycosylated modified carrier protein comprising a third immunogenic glycan). In further embodiments, a multivalent immunogenic composition comprises an O-glycosylated modified carrier protein directly or indirectly attached to two or more distinct immunogenic glycans.
Also provided is a method of making an immunogenic composition comprising the step of mixing an immunogenic conjugate of the invention (e.g., an O-glycosylated modified carrier protein comprising an immunogenic glycan) with a pharmaceutically acceptable adjuvant, excipient, or diluent.
Provided are methods of inducing an antibody response in a mammal (e.g., a human mammal), comprising administering to the mammal an immunologically effective amount of an immunogenic composition of the present invention. Also provided are immunogenic compositions for use in inducing an antibody or immune response in a mammal. Provided are immunogenic compositions for the manufacture of a medicament for inducing an antibody or immune response in a mammal.
Streptococcus pneumoniae is a globally important encapsulated human pathogen. Streptococcus pneumoniae (S. pneumoniae, pneumococcus) is a Gram-positive bacterium responsible for considerable morbidity and mortality (particularly in infants and the elderly), causing invasive diseases such as bacteraemia and meningitis, pneumonia and other non-invasive diseases, such as acute otitis media. The major clinical syndromes caused by S. pneumoniae are widely recognized and discussed in standard medical textbooks. For instance, Invasive Pneumococcal Disease (IPD) is defined as any infection in which S. pneumoniae is isolated from the blood or another normally sterile site. Provided herein are immunogenic compositions for use in the treatment or prevention of a disease caused by Streptococcus pneumoniae infection, e.g. pneumonia, invasive pneumococcal disease (IPD), exacerbations of chronic obstructive pulmonary disease (eCOPD), otitis media, meningitis, bacteraemia, pneumonia and/or conjunctivitis. Provided are immunogenic compositions for use in inducing an immune response against a Streptococcus pneumoniae glycan in a mammal. Also provided are immunogenic compositions for inducing an antibody or immune response against a Streptococcus pneumoniae glycan in a mammal. Provided are immunogenic compositions for the manufacture of a medicament for inducing an antibody or immune response against a Streptococcus pneumoniae glycan in a mammal.
The disease caused by Streptococcus pneumoniae infection may be selected from pneumonia, invasive pneumococcal disease (IPD), exacerbations of chronic obstructive pulmonary disease (eCOPD), otitis media, meningitis, bacteraemia, pneumonia and/or conjunctivitis. Where the human mammal is an infant (defined as 0-2 years old in the context of the present invention), the disease may be selected from otitis media, meningitis, bacteraemia, pneumonia and/or conjunctivitis. In one aspect, where the human mammal is an infant (defined as 0-2 years old in the context of the present invention), the disease is selected from otitis media and/or pneumonia. Where the human mammal is elderly (i.e., 50 years or over in age, typically over 55 years and more generally over 60 years), the disease may be selected from pneumonia, invasive pneumococcal disease (IPD), and/or exacerbations of chronic obstructive pulmonary disease (eCOPD). In one aspect, where the human mammal is elderly, the disease is invasive pneumococcal disease (IPD). In another aspect, where the human mammal is elderly, the disease is exacerbations of chronic obstructive pulmonary disease (eCOPD).
Adjuvants
Adjuvants are non-antigen components used in immunogenic and vaccine compositions in order to enhance and modulate the immune or antibody response to the antigen. It is well recognized that an adjuvant enhances the induction, magnitude, and/or longevity of an antigen's immunological effect. An adjuvant is a compound that, when the compound is administered alone, does not generate an immune or antibody response to the antigen.
Immunogenic and vaccine compositions of the invention may comprise an adjuvant in addition to the antigen. In certain embodiments, the adjuvant is pharmaceutical-grade. An adjuvant may be administered before, concomitantly with, or after administration of an immunogenic or vaccine composition.
Specific examples of adjuvants include, but are not limited to, aluminum salts (alum) (such as aluminum hydroxide, aluminum phosphate, and aluminum sulfate), 3 De-O-acylated monophosphoryl lipid A (MPL), MF59, AS03, AS04, polysorbate 80 (TWEEN 80), imidazopyridine compounds (see [88]), imidazoquinoxaline compounds (see [89]), CpG ([90]) or unmethylated CpG containing oligonucleotides [91]), and saponins, such as QS21 (see [92]). In some embodiments, the adjuvant is Freund's adjuvant (complete or incomplete). Other adjuvants are oil in water emulsions (such as squalene or peanut oil), optionally in combination with immune stimulants, such as monophosphoryl lipid A (see [93]). In certain embodiments, the adjuvant is an oil-in-water emulsion (for example MF59, and AS03), liposomes (e.g., 3-o-desacyl-4′-Monophosphoryl Lipid A (MPL)) and/or saponins (e.g., QS21) (e.g., AS01), TLR2 agonist, TLR3 agonist, TLR4 agonist, TLR5 agonist, TLR6 agonist, TLR7 agonist, TLR8 agonist, TLR9 agonist, aluminium salt, nanoparticle, microparticle, ISCOMS, calcium fluoride, organic compound composite, or combinations thereof. See, e.g., [94], [95], and [96]). In a particular embodiment, the immunogenic or vaccine composition of the invention comprises an antigen and an adjuvant wherein the adjuvant is an oil-in-water emulsion (e.g., MF59, and AS03 and their respective subtypes including subtypes B and E), an aluminum salt (e.g., aluminum phosphate and aluminum hydroxide), a liposome, a saponin (e.g. QS21), an agonist of Toll-like receptors (TLRa) (e.g., TLR4a and TLR7a), or a combination thereof (e.g., Alum-TLR7a ([97]). By “TLR agonist” it is meant a component which is capable of causing a signaling response through a TLR signaling pathway, either as a direct ligand or indirectly through generation of endogenous or exogenous ligand ([98]). A TLR4 agonist, for example, is capable of causing a signalling response through a TLR-4 signalling pathway. A suitable example of a TLR-4 agonist is a lipopolysaccharide, suitably a non-toxic derivative of lipid A, particularly monophosphoryl lipid A or more particularly 3-Deacylated monophoshoryl lipid A (3D-MPL). In certain embodiments, the immunogenic or vaccine composition comprises one or more adjuvants.
In certain embodiments, the adjuvant is Monophosphoryl lipid A (such as 3-de-O-acylated monophosphoryl lipid A (3D-MPL)) or a derivative thereof, or a combination of monophosphoryl lipid A together with either an aluminium salt (e.g., aluminium phosphate or aluminium hydroxide) or an oil-in-water emulsion. In certain embodiments, the adjuvant comprises a formulation of QS21, 3D-MPL and tocopherol in an oil in water emulsion ([99]).
Excipients
Pharmaceutically acceptable excipients can be selected by those of skill in the art. For example, a pharmaceutically acceptable excipient may be a buffer, such as Tris (trimethamine), phosphate (e.g. sodium phosphate, sucrose phosphate glutamate), acetate, borate (e.g. sodium borate), citrate, glycine, histidine and succinate (e.g. sodium succinate), suitably sodium chloride, histidine, sodium phosphate or sodium succinate. A pharmaceutically acceptable excipient may include a salt, for example sodium chloride, potassium chloride or magnesium chloride. Optionally, a pharmaceutically acceptable excipient contains at least one component that stabilizes solubility and/or stability. Examples of solubilizing/stabilizing agents include detergents, for example, laurel sarcosine and/or polysorbate (e.g. TWEEN 80 (Polysorbate-80)). Examples of stabilizing agents also include poloxamer (e.g. poloxamer 124, poloxamer 188, poloxamer 237, poloxamer 338 and poloxamer 407). A phamaceutically acceptable excipient may include a non-ionic surfactant, for example polyoxyethylene sorbitan fatty acid esters, TWEEN 80 (Polysorbate-80), TWEEN 60 (Polysorbate-60), TWEEN 40 (Polysorbate-40) and TWEEN 20 (Polysorbate-20), or polyoxyethylene alkyl ethers (suitably polysorbate-80). Alternative solubilizing/stabilizing agents include arginine, and glass forming polyols (such as sucrose, trehalose and the like). A pharmaceutically excipient may be a preservative, for example phenol, 2-phenoxyethanol, or thiomersal. Other pharmaceutically acceptable excipients include sugars (e.g. lactose, sucrose), and proteins (e.g. gelatine and albumin). Pharmaceutically acceptable excipients for use with the present invention include saline solutions, aqueous dextrose and glycerol solutions (also referred to as “carriers” or “fillers” in the art). Numerous pharmaceutically acceptable excipients are described, for example, in [100].
Immunogenic compositions if the invention may also comprise diluents such as saline, and glycerol. Additionally, immunogenic compositions may comprise auxiliary substances such as wetting agents, emulsifying agents, pH buffering substances, and/or polyols.
Immunogenic compositions if the invention may also comprise one or more salts, e.g. sodium chloride, calcium chloride, sodium phosphate, monosodium glutamate, and aluminum salts (e.g. aluminum hydroxide, aluminum phosphate, alum (potassium aluminum sulfate), or a mixture of such aluminum salts).
Immunogenic compositions if the invention may also comprise a preservative, e.g. a mercury derivative thimerosal or 2-phenoxyethanol. In an embodiment, the immunogenic composition of the invention comprises 0.001% to 0.01% thimerosal. In an embodiment, the immunogenic composition of the invention comprises 0.001% to 0.01% 2-phenoxyethanol.
Immunogenic compositions if the invention may also comprise a detergent e.g. polysorbate, such as TWEEN 80 (Polysorbate 80). Detergents may be present at low levels e.g. <0.01%, but higher levels have been suggested for stabilising antigen formulations e.g. up to 10%.
Administration
Immunogenic compositions or vaccines of the invention may be used to induce an immune or antibody response and/or protect or treat a mammal susceptible to infection, by administering said immunogenic composition or vaccine composition to said mammal via systemic or mucosal route. These administrations may include injection via the intramuscular (IM), intraperitoneal, intradermal (ID) or subcutaneous routes; or via mucosal administration to the oral/alimentary, respiratory, genitourinary tracts. For example, intranasal (IN) administration may be used. Although the immunogenic composition or vaccine of the invention may be administered as a single dose, components thereof may also be co-administered together at the same time or at different times. For co-administration, the optional adjuvant, for example, may be present in any or all of the different administrations, however in one particular aspect of the invention it is present in combination with the immunogenic O-glycosylated modified carrier protein. In addition to a single route of administration, two different routes of administration may be used. Following an initial vaccination, subjects may receive one or several booster immunizations adequately spaced.
It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application.
Escherichia coli deficient in O-antigen lipopolysaccharide ligase gene waaL (E. coli W3110 ΔwaaL, ΔwecA-wzzE, ΔO16::wbgT-wbgZ cluster of P. shigelloides O17 (S. sonnei) (“E. coli W3110ΔwaaL” hereafter)) containing a chromosomal copy of a polysaccharide biosynthesis cluster (0-antigen or capsular polysaccharide) as well as two plasmids expressing PglL and a modified carrier protein was used. A single colony was inoculated in 50 ml TBdev medium [yeast extract 24 g/L, soy peptone 12 g/L, glycerol 100% 4.6 mL/L, K2HPO4 12.5 g/L, KH2PO4 2.3 g/L, MgCl2x6H2O 2.03 g/L) and grown at 30° C. to an OD of 0.8. At this point, 0.1 mM IPTG and 0.1% arabinose were added as inducers. The culture was further incubated o/n and harvested for further analysis (see [00119]). In case of bioreactor evaluation, a 50 mL (uninduced) o/n culture was used to inoculate a 11 culture in a 21 bioreactor. The bioreactor was stirred with 500-1000 rpm, pH was kept at 7.2 by auto-controlled addition of either 2 M KOH or 20% H3PO4 and the cultivation temperature was set at 30° C. The level of dissolved oxygen (pO2) was kept at 10% oxygen. In batch phase cells were grown in a TBdev medium as described above but containing glycerol at 50 g/L. As feed medium TBdev supplemented with 250 g/L glycerol and 0.1% IPTG (one-plasmid system) or 0.1% IPTG and 2.5% arabinose (two-plasmid system) was used. Induction with 0.1 mM IPTG (one-plasmid system) and 0.1 mM IPTG and 0.1% arabinose (2-plasmid system) was done at OD=35, prior to starting the fed-batch phase of growth. A linear feed rate was sustained for 24 h, followed by a 16 h starvation period. The bioreactor culture was harvested after a total of ≈40 h cultivation, when it should have reached an OD600 of 80.
The production process was analyzed by Coomassie brilliant blue staining or Western blot as described previously ([101]). After being blotted on nitrocellulose membrane, the sample was immunostained with the either anti-His, anti-glycan or anti-carrier-protein. Anti-rabbit IgG-HRP (Biorad) was used as secondary antibody. Detection was carried out with ECL™ Western Blotting Detection Reagents (Amersham Biosciences, Little Chalfont Buchinghamshire).
For periplasmic protein extraction, the cells were harvested by centrifugation for 20 min at 10,000 g and resuspended in 1 volume 0.9% NaCl. The cells were pelleted by centrifugation during 25-30 min at 7,000 g. The cells were resuspended in Suspension Buffer (25% Sucrose, 100 mM EDTA 200 mM Tris HCl pH 8.5, 250 OD/ml) and the suspension was incubated under stirring at 4-8° C. during 30 min. The suspension was centrifuged at 4-8° C. during 30 min at 7,000-10,000 g. The supernatant was discarded, the cells were resuspended in the same volume ice cold 20 mM Tris HCl pH 8.5 and incubated under stirring at 4-8° C. during 30 min. The spheroblasts were centrifuged at 4-8° C. during 25-30 min at 10,000 g, the supernatant was collected and passed through a 0.2 g membrane. Periplasmic extract was loaded on a 7.5% SDS-PAGE, and stained with Coomasie for identification.
For bioconjugate purification, the supernatant containing periplasmic proteins obtained from 100,000 OD of cells was loaded on a Source Q anionic exchange column (XK 26/40≈180 ml bed material) equilibrated with buffer A (20 mM Tris HCl pH 8.0). After washing with 5 column volumes (CV) buffer A, the proteins were eluted with a linear gradient of 15 CV to 50% buffer B (20 mM Tris HCl+1M NaCl pH 8.0) and then 2 CV to 100% buffer B. Protein were analyzed by SDS-PAGE and stained by Coomassie. Bioconjugate may elute at conductivity between 6-17 mS. The sample was concentrated 10 times and the buffer was exchanged to 20 mM Tris HCl pH 8.0.
Bioconjugate was loaded on a Source Q column (XK 16/20˜28 ml bed material) equilibrated with buffer A: 20 mM Tris HCl pH 8.0. The identical gradient that was used above was used to elute the bioconjugate. Protein were analyzed by SDS-PAGE and stained by Coomassie. Normally the bioconjugate elutes at conductivity between 6-17 mS. The sample was concentrated 10 times and the buffer was exchanged to 20 mM Tris HCl pH 8.0.
Bioconjugate was loaded on Superdex 200 (Hi Load 26/60, prep grade) that was equilibrated with 20 mM Tris HCl pH 8.0. Protein fractions from Superdex 200 column were analyzed by SDS-PAGE and stained by Coomassie stained.
Bioconjugates from different purification steps were analyzed by SDS-PAGE and stained by Coomassie. Bioconjugate is purified to more than 98% purity using the process. Bioconjugate can be successfully produced using this technology.
Pseudomonas exotoxin A (EPA) carrier protein (SEQ ID NO: 1) was modified to incorporate one or more GlycoTags from Neisseria meningitidis pilin PilE (wild type sequence provided as SEQ ID NO: 137) (for methods see [29]; [6]; [4]; and [31], all incorporated herein by reference in their entireties). Recombinant EPA (rEPA, SEQ ID NO: 1) was modified to make three other recombinant EPA proteins:
Neisseria meningitidis PglL (NmPglL) (polynucleotide sequence SEQ ID NO 8, encoding amino acid sequence SEQ ID NO: 9), Shigella sonnei O-antigen gene cluster (polynucleotide sequence SEQ ID NO: 6, encoding amino acid sequences SEQ ID NO: 208-216), and one of carrier proteins rEPA1, rEPA2, and rEPA3 (operatively linked to a DsbA periplasmic signal sequence (SEQ ID NO: 5, encoding SEQ ID NO: 4)) were introduced into Escherichia coli W3110 deficient in O-antigen lipopolysaccharide ligase gene waaL (E. coli W3110ΔwaaL). Three cell lots were made, one for each of rEPA1-rEPA3. Coomassie blue staining and Western blot assays confirmed that NmPglL efficiently transferred lipid-carrier-linked S. sonnei O-antigen to each of rEPA1, rEPA2, and rEPA3 (corresponding to #1-#4, respectively, in
The stability of the rEPA1-S. sonnei O-antigen bioconjugate was studied at three different temperatures (−80° C., 2-8° C., and room temperature (RT) 20-25° C.) for a time of six months. Additionally, five freeze/thaw cycles (5 FT) on purified rEPA1-S. sonnei O-antigen were performed. SEC-HPLC readouts of samples taken at zero months, two weeks, one month, three months, and six months revealed that the rEPA1-S. sonnei O-antigen bioconjugate peak area was constant over time
To evaluate the immunogenicity of the rEPA1-S. sonnei O-antigen bioconjugate, four female New Zealand White Rabbits (age 3-4 months) were divided into two groups (two rabbits per group) and subcutaneously injected at zero, seven, ten, and eighteen days with a bioconjugate composition comprising 2 μg of sugar, 40 μg of protein, and non-Freund's adjuvant (Group 1) or 10 μg of sugar, 200 μg of protein, and non-Freund's adjuvant (Group 2). Bleeds occurred at zero, twenty-one, and twenty-eight days. Western blot of the blood samples taken at twenty-eight days revealed that antibodies against S. sonnei O-antigen and EPA were generated in all subjects (
To evaluate the technical feasibility of using multiple GlycoTags on the carrier protein, EPA was modified to incorporate either one or two copies of the NmPilE GlycoTag having the sequence SEQ ID NO: 9. For EPA incorporating only one copy of the GlycoTag, it was located at the N-terminus (rEPA1). For EPA incorporating two copies of the GlycoTag SEQ ID NO: 140, the first GlycoTag was located at the N-terminus and the second was located at the C-terminus (rEPA43, SEQ ID NO: 135). Neisseria meningitidis PglL (NmPglL) was applied to rEPA1 or rEPA43 in the presence of one of three distinct lipid-carrier-linked polysaccharides: S. sonnei O-antigen, S. flexneri 2a CPS, or Streptococcus pneumoniae 12F CPS. NmPglL transferred each of S. sonnei O-antigen, S. flexneri 2a CPS, and Streptococcus pneumoniae 12F CPS onto rEPA1 and rEPA43 (
These results show that a carrier protein modified to incorporate more than one GlycoTag may be used for in vivo bioconjugation.
To evaluate the versatility of NmPglL toward carrier protein, known carrier proteins AcrA, PcrV, and Crm197 were also modified as above to incorporate one copy of the NmPilE GlycoTag having the sequence SEQ ID NO: 140. For modified AcrA (mAcrA), a pelB signal sequence (residues 1-22 of SEQ ID NO: 198) was operably linked to the N-terminus of the AcrA sequence, the GlycoTag SEQ ID NO: 140 was operably linked to the C-terminus of AcrA, and a 6×His-tag was operably linked to the C-terminus of the GlycoTag (SEQ ID NO: 199 for mAcrA). For modified PcrV (mPcrV), a LtIIb signal sequence (residues 1-23 of SEQ ID NIO: 202) was operably linked to the N-terminus of the PcrV sequence, the GlycoTag SEQ ID NO: 140 was operably linked to the C-terminus of PcrV, and a 6×His-tag was operably linked to the C-terminus of the GlycoTag (SEQ ID NO: 202 for mPcrV). For a first modified Crm197 (mCrm197), a DsbA signal sequence (SEQ ID NO: 4) was operably linked to the N-terminus of the Crm197 sequence, the GlycoTag SEQ ID NO: 140 was operably linked to the C-terminus of Crm197, and a 6×His-tag was operably linked to the C-terminus of the GlycoTag (SEQ ID NO: 204 for mCrm197). For a second modified Crm197 (m2Crm197, SEQ ID NO: 207), a DsbA signal sequence (SEQ ID NO: 4) was operably linked to the N-terminus of the GlycoTag sequence SEQ ID NO: 140, which were together operably linked to the N-terminus of the Crm197 sequence; the GlycoTag SEQ ID NO: 140 was also operably linked to the C-terminus of Crm197, and a 6×His-tag was operably linked to the C-terminus of the GlycoTag (see m2Crm197 sequence SEQ ID NO: 207). NmPglL, S. sonnei O-antigen, and one of mAcrA, mPcrV, mCrm197, and m2Crm197 were operatively introduced into E. coli W3110ΔwaaL. In this way, NmPglL contacted lipid-carrier-linked S. sonnei O-antigen in the presence of mAcrA, mPcrV, mCrm197, or m2Crm197. NmPglL transferred S. sonnei O-antigen onto mAcrA, mPcrV, mCrm197, and m2Crm197 (
To evaluate the substrate versatility of NmPglL, a polysaccharide gene cluster (i.e., nucleotide sequence) encoding a Pneumococcal capsular polysaccharide (CPSs) from one of each of serotypes Sp8, Sp12F, Sp14, Sp22A, Sp23A, and Sp33F was chromosomally introduced (Table 1) into E. coli W3110ΔwaaL. NmPglL, and rEPA1 or rEPA43 nucleotide sequences (Example 1 above) were also operatively introduced into each of the E. coli W3110ΔwaaL cells. Twelve recombinant host cells were made, six incorporating one of each of the six different Pneumococcal CPSes and rEPA1, and another six incorporating one of each of the six different Pneumococcal CPSes and rEPA43. In this way, NmPglL contacted each lipid-carrier-linked Pneumococcal CPS peptidoglycan in the presence of rEPA1 or rEPA43, and NmPglL transferred Pneumococcal CPS glycan onto rEPA1 or rEPA43 in vivo:
The results with respect to Pneumococcal Sp15A CPS were inconclusive because no transfer of Sp15A CPS was detected, but transfer of Pneumococcal Sp14 CPS was detected and both Sp15A and Sp14 CPSes have the reducing end structure Galactose-β1,4-Glucose-UndPP. NmPglL transferred onto rEPA1 and rEPA43 all of the Pneumococcal serotype 8, 12F, 14, 22A, 23A, and 33F glycans (having reducing end structures Glucuronic acid-β1,4-glucose (Sp8), N-acetyl-fucosamine-α1,3-N-acetyl-galactosamine (Sp12F), Galactose-β1,4-glucose (Sp14), Rhamnose-β1,4-glucose (Sp22A, Sp23A), and Galactofuranose-β1,3-glucose (Sp33F), respectively). These results confirm that NmPglL glycan substrates include those having glucose or GalNAc at its reducing end (also supported by Faridmoayer et al. ([3]).
Twenty Neisseria PglL proteins were identified, each from different Neisseria species. Using established methods, each PglL was first screened for its ability to transfer the S. sonnei O-antigen (made by the operon consisting of the wbgT, wbgU, wzx, wxy, wbgV, wbgW, wbgX, wbgY, and wbgZ genes, encoding proteins of SEQ ID NOs: 208-216, which make a saccharide with a reducing end structure N-acetyl-altruronic acid-α1,3-4-amino-N-acetyl-fucosamine)7 onto an endogenous pilin and with an efficiency that was at least comparable to (i.e., equal to or greater than) that of NmPglL (control). Six Neisseria meningitidis PglL homologues were thereby identified. The six Neisseria PglL proteins were then each screened for its ability to transfer the S. sonnei O-antigen onto rEPA1 and with an efficiency that was at least comparable to NmPglL (control). Four Neisseria meningitidis PglL homologues were thereby identified. For methods see [4], [6], [29], and [31], all incorporated herein by reference in their entireties. The results were as summarized in Table 2 below:
Neisseria meningitidis(control)
Neisseria gonorrhoeae
Neisseria lactamica 020-06
Neisseria lactamica ATCC 23970
Neisseria gonorrhoeae F62
Neisseria cinerea ATCC 14685
Neisseria cinerea ATCC 14685
Neisseria mucosa
Neisseria mucosa
Neisseria flavescens NRL30031/H210
Neisseria mucosa ATCC 25996
Neisseria mucosa ATCC 25996
Neisseria sp. oral taxon 014
Neisseria sp. oral taxon 014
Neisseria arctica
Neisseria shayeganii 871
Neisseria shayeganii 871
Neisseria shayeganii 871
Neisseria sp. 83E34
Neisseria sp. 83E34
Neisseria wadsworthii
Neisseria wadsworthii
Neisseria elongata subsp. glycolytica
Neisseria elongata subsp. glycolytica
Neisseria bacilliformis ATCC
Neisseria bacilliformis ATCC
Neisseria sp. oral taxon 020 str. F0370
Neisseria sp. oral taxon 020 str. F0370
Neisseria sp. 74A18 PglL
Neisseria sp. 74A18 PglL
Neisseria weaver ATCC 51223
Neisseria macacae ATCC 33926
Neisseria macacae ATCC 33926
NmPglL, Neisseria gonorrhoeae PglL (NgPglL), Neisseria lactamica 020-06 (NlPglL), Neisseria elongata subsp. glycolytica ATCC 29315 (NePglL), and Neisseria bacilliformis ATCC BAA-1200 (NbPglL) were shown to transfer the lipid-carrier-linked S. sonnei O-antigen onto the soluble NmPilE-based GlycoTag. For most, this was a glycan transfer onto a non-endogenous GlycoTag. These results indicate that NgPglL, NlPglL, Neisseria mucosa ATCC 25996 (NmuPglL), Neisseria shayeganii 871 (SEQ ID NO: 33) (NsPglL), NePglL, and NbPglL all transfer a lipid linked glycan substrate with a reducing end structure N-acetyl-fucosamine (FucNAc) (S-2 to S-1 structure being N-acetyl-altruronic acid-α1,3-4-amino-N-acetyl-fucosamine) onto its endogenous pilin or rEPA1 with an efficiency that is at least comparable to control (NmPglL). See also Example 8 below.
Designed Carrier Proteins with Internal NmPilE GlycoTag
Twenty-two modified EPA carrier proteins were designed and produced, each incorporating one copy, at an internal residue, of the NmPilE GlycoTag having the sequence SEQ ID NO: 140 (29 amino acid sequence corresponding to residues 45-73 of NmPilE sequence SEQ ID NO: 137). The below-listed EPA residues (numbered with respect to SEQ ID NO: 1) were substituted for the GlycoTag sequence SEQ ID NO: 140 (i.e., an insertion of 29 amino acids):
To evaluate glycan transfer to a GlycoTag located within a carrier protein (i.e., an “Internal GlycoTag”), nucleotide sequences encoding NmPglL (SEQ ID NO: 9), S. sonnei O-antigen (SEQ ID NOs: 208-216), and one of each of rEPA4 to rEPA25 (operatively linked to a DsbA periplasmic signal sequence) were introduced into E. coli W3110ΔwaaL (full genotype E. coli W3110 ΔwaaL::pglLNm, ΔwecAwzzECA, ΔO16::wbgT-wbgZ cluster of P. shigelloides O17 Twenty-two different cell lots were made, one for each of rEPA4-rEPA25. Western blot assays confirmed that NmPglL efficiently transferred lipid-carrier-linked S. sonnei O-antigen to all of rEPA4-rEPA17, rEPA19-rEPA25 in vivo. In this experiment, results with respect to rEPA15 were inconclusive because rEPA18 expression was not observed.
Homologues of Neisseria meningitidis pilin PilE were identified from Neisseria gonorrhoeae (NgPilin), Neisseria lactamica 020-06 (NiPilin), Neisseria elongate subsp. glycolytica ATCC 29315 (NePilin), and Neisseria bacilliformis ATCC BAA-1200 (NbPilin), Neisseria mucosa ATCC 25996 (NmuPilin), and Neisseria shayeganii 871 (NsPilin) (amino acid sequences SEQ ID NOs 143, 148, 153, 156, 159, and 162, respectively). See also the endogenous pilins in Example 3 and Table 2 above. GlycoTags from each of those pilin were designed.
Using established methods, EPA carrier protein (SEQ ID NO: 1) was modified to incorporate one copy of a GlycoTag from one of each of NgPilin, NlPilin, NePilin, NbPilin, NmuPilin, and NsPilin. Six recombinant EPA (rEPA) proteins were made:
Western blot assays was used to determine whether NgPglL (SEQ ID NO: 11), NlPglL (SEQ ID NO: 13), NePglL (SEQ ID NO: 39), NbPglL (SEQ ID NO: 41), NmuPglL (SEQ ID NO: 25) (
These results show that PglLs (NgPglL, NlPglL, and NsPglL) transfer a lipid-carrier-linked peptidoglycan having reducing end structure N-acetyl-fucosamine (FucNAc) (S-2 to S-1 structure being N-acetyl-altruronic acid-α1,3-4-amino-N-acetyl-fucosamine) onto a modified EPA carrier protein that has at its N-terminus, an endogenous GlycoTag. Of these three, NgPglL transferred S. sonnei O-antigen to rEPA26 more efficiently than NlPglL transferred S. sonnei O-antigen to rEPA27. Also, NlPglL transferred S. sonnei O-antigen to rEPA27 more efficiently than NsPglL transferred S. sonnei O-antigen to rEPA31. NmPglL also transferred S. sonnei O-antigen onto rEPA26, rEPA27 and rEPA31
Designed Carrier Proteins Comprising Neisseria gonnorrhoeae GlycoTag(s)
Modified EPA carrier proteins were designed and produced, each incorporating one or two copies of a Neisseria gonorrhoeae Pilin GlycoTag sequence. Internal EPA residues R274, S408, and/or A519 (numbered with respect to SEQ ID NO: 1) were substituted for the NgPilin GlycoTag having the sequence SEQ ID NO: 145 or SEQ ID NO: 146 (30 amino acid sequence corresponding to residues 52-81 of NgPilin sequence SEQ ID NO: 143 and 20 amino acid sequence corresponding to residues 62-81 of NgPilin sequence SEQ ID NO: 143, respectively) (Table 4 below).
Using established methods, nucleotide sequences encoding Neisseria gonorrhoeae PglL (NgPglL) (SEQ ID NO: 11), Shigella sonnei O-antigen (SEQ ID NOs: 208-216), and one of each of rEPA32-rEPA39 (SEQ ID NOs: 113, 115 117, 119, 121, 123, 125, and 127, respectively, under DsbA periplasmic signal sequence) were operatively introduced into each of two E. coli W3110ΔwaaL host cell strains. Strain “st12807” has the NgPglL sequence integrated at the waaL locus and has genotype: W3110 ΔwaaL, ΔwecAwzzECA ΔO16::wbgT-wbgZ cluster of P. shigelloides O17, ΔwaaL::pglL_Neisseria_gonorrhoeae_CNT56492. Strain “st8774” does not have the NgPglL sequence integrated at the waaL locus and has genotype: W3110 ΔwaaL ΔwecAwzzECA ΔO16::wbgT-wbgZ cluster of P. shigelloides O17. Sixteen cell lots were made, one for each of rEPA32-rEPA39 in each of strains st12807 and st8774. For methods see [4], [6], [29], and [31], all incorporated herein by reference in their entireties. Western blot assays show that NgPglL efficiently transferred lipid-carrier-linked S. sonnei O-antigen to most of the modified EPAs in vivo (rEPA32, rEPA34, rEPA36-rEPA38), but inefficiently for two (rEPA33 and rEPA39), and not at all for one (rEPA35).
These results show that a modified EPA carrier protein having internal residues R274, or A519 substituted with either GlycoTag sequence SEQ ID NOs: 145 and 146 is efficient for in vivo O-glycosylation of the modified EPA via NgPglL. Also, a modified EPA carrier protein having both internal residues R274 and A519 substituted with GlycoTag sequence SEQ ID NO 146 is efficient for in vivo O-glycosylation of the modified EPA via NgPglL.
These results also show that a modified EPA carrier protein having internal residue S408 substituted with GlycoTag sequence SEQ ID NO: 145 works, but inefficiently, for in vivo O-glycosylation of the modified EPA via NgPglL This is interesting because, in other studies, a modified EPA carrier protein incorporating a NmGlycoTag at residue S408 was efficiently O-glycosylated by NmPglL (unpublished data).
A modified EPA carrier protein having internal residue S408 substituted with GlycoTag sequence SEQ ID NO: 146 did not work for in vivo O-glycosylation of the modified EPA via NgPglL.
The abilities of NmPglL and NgPglL to transfer glycan to modified EPA carrier proteins comprising a NgPilin GlycoTag were compared. rEPA32, rEPA34, rEPA36, and rEPA38 from Example 6 were used as well as:
Using established methods, a nucleotide sequence encoding NmPglL (SEQ ID NO: 9) or NgPglL (SEQ ID NO: 11), a nucleotide sequence encoding enzymes required to make Shigella sonnei O-antigen (SEQ ID NOs: 208-216), and a nucleotide sequence encoding one of each of rEPA32, rEPA34, rEPA36, rEPA38, rEPA40, rEPA41, and rEPA42 (under DsbA periplasmic signal sequence) were operatively introduced into E. coli W3110ΔwaaL. Fourteen different cell lots were made. Coomassie blue staining and Western blot assays show that NmPglL efficiently transferred lipid-carrier-linked S. sonnei O-antigen to all of rEPA32, rEPA34, rEPA36, rEPA38, rEPA40, rEPA41, and rEPA42 in vivo. Likewise, NgPglL efficiently transferred lipid-carrier-linked S. sonnei O-antigen to all of rEPA32, rEPA34, rEPA36, rEPA38, rEPA40, rEPA41, and rEPA42 in vivo.
These results show that NgPilin GlycoTag sequence SEQ ID NO: 145 and NgPilin GlycoTag sequence SEQ ID NO: 146 are efficiently O-glycosylated by both NmPglL and NgPglL when the GlycoTag is introduced at the N-terminus or into an internal residue of a carrier protein, here EPA. This was true when using either one copy or two copies of NgPilin GlycoTag sequences SEQ ID NO: 145 and SEQ ID NO: 146. NgPglL glycosylated GlycoTag sequence SEQ ID NO: 146 more efficiently than did NmPglL.
NmPglL and NmPglL Homologues Transfer Pneumococcal Capsular Polysaccharides (CPS) to rEPA1
NmPglL and the twenty homologues thereof described in Example 3 were assessed for their ability to transfer Streptococcus pneumoniae serotype Sp8 or Sp22A CPS glycans onto rEPA1 (under DsbA periplasmic signal sequence) in vivo. Pneumococcal Sp8 CPS has a reducing end structure of Glucuronic acid-β1,4-glucose (Table 1). Pneumococcal Sp22A CPS has a reducing end structure of Rhamnose-β1,4-glucose (Table 1).
Using established methods, a nucleotide sequence encoding a CPS from Pneumococcal serotype Sp8 or Sp22A, as well as a nucleotide sequence encoding one of the twenty-one Neisserial PglL proteins, and a nucleotide sequence encoding rEPA1 were operatively introduced into E. coli W3110ΔwaaL. Forty-two host cells were made (each CPS being assayed with each of the twenty-one PglLs). In this way, Neisserial PglL contacted each lipid-carrier-linked Pneumococcal CPS peptidoglycan in the presence of rEPA1 and the Neisserial PglL transferred Pneumococcal CPS glycan onto rEPA1 in vivo (Table 5 and
Coomassie blue staining and Western blot assays confirmed that NmPglL, Neisseria gonorrhoeae PglL (NgPglL) (SEQ ID NO: 11), Neisseria lactamica 020-06 (NlPglL) (SEQ ID NO: 13), Neisseria lactamica ATCC 23970 PglL (NlATCC23970PglL) (SEQ ID NO: 15), and Neisseria gonorrhoeae F62 PglL (NgF62PglL) (SEQ ID NO: 17) transfer lipid-carrier-linked Pneomococcal Sp. 8 CPS glycan onto rEPA1.
Coomassie blue staining and Western blot assays confirmed that NmPglL, NgPglL, NlPglL, and NgF62PglL transfer lipid-carrier-linked Pneomococcal Sp. 22A CPS glycan onto rEPA1.
Neisseria meningitidis PglL
Neisseria gonorrhoeae PglL
Neisseria lactamica 020-06 PglL
Neisseria lactamica ATCC 23970 PglL
Neisseria gonorrhoeae F62 PglL
Neisseria cinerea ATCC 14685 PglL
Neisseria mucosa PglL
Neisseria flavescens NRL30031/H210
Neisseria mucosa ATCC 25996 PglL
Neisseria sp. oral taxon 014 strain
Neisseria arctica PglL
Neisseria shayeganii 871 PglL
Neisseria shayeganii 871 PglL
Neisseria sp. 83E34 PglL
Neisseria wadsworthii PglL
Neisseria elongata subsp. glycolytica
Neisseria bacilliformis ATCC
Neisseria sp. oral taxon 020 str.
Neisseria sp. 74A18 PglL
Neisseria weaver ATCC 51223 PglL
Neisseria macacae ATCC 33926 PglL
Comparison of Pilin structures
Neisseria meningitidis
Neisseria gonorrhoeae
Neisseria lactamica 020-06
Neisseria shayeganii 871
Neisseria meningitidis
Neisseria gonorrhoeae
Neisseria meningitidis
Neisseria gonorrhoeae
Neisseria lactamica 020-06
Neisseria shayeganii 871
Pseudomonas exotoxin A (EPA) amino acid sequence (mature sequence/signal sequence removed). Corresponds to NCBI Reference Sequence WP_016851883.1.
Pseudomonas exotoxin A (EPA) amino acid sequence
MHLIPHWIPLVASLGLLAGGSFASAAEEAFDLWNECAKACVLDLKDGVRS
Pseudomonas exotoxin A (EPA) polynucleotide sequence. Corresponds to NCBI Accession JX026663.1
DsbA signal sequence.
DsbA signal peptide polynucleotide sequence.
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen cluster nucleotide sequence (comprising wbgT, wbgU, wzx, wxy, wbgV, wbgW, wbgX, wbgY, and wbgZ coding regions; 10963 bps). Corresponds to NCBI Genbank Accession AF285970.1. [102]
PelB signal sequence.
Neisseria meningitidis PglL (NmPglL) nucleotide sequence.
Neisseria meningitidis PglL (NmPglL) amino acid sequence. Corresponds to NCBI GenBank Accession AEK98518.1.
Neisseria gonnorrhoeae PglL (NgPglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession CNT56492.1.
Neisseria gonnorrhoeae PglL (NgPglL) amino acid sequence. Corresponds to NCBI GenBank Accession CNT56492.1.
Neisseria lactamica 020-06 PglL (NlPglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession CBN87842.1.
Neisseria lactamica 020-06 PglL (NlPglL) amino acid sequence. Corresponds to NCBI GenBank Accession CBN87842.1.
Neisseria lactamica ATCC 23970 PglL (NlATCC23970PglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EEZ75009.1.
Neisseria lactamica ATCC 23970 PglL (NlATCC23970PglL) amino acid sequence. Corresponds to NCBI GenBank Accession EEZ75009.1.
Neisseria gonorrhoeae F62 PglL (NgF62PglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EFF40644.1.
Neisseria gonorrhoeae F62 PglL (NgF62PglL) amino acid sequence. Corresponds to NCBI GenBank Accession EFF40644.1.
Neisseria cinerea ATCC 14685 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EEZ72274.1.
Neisseria cinerea ATCC 14685 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EEZ72274.1.
Neisseria mucosa PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession KGJ31457.1.
Neisseria mucosa PglL amino acid sequence. Corresponds to NCBI GenBank Accession KGJ31457.1.
Neisseria flavescens NRL30031/H210 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EEG34481.1.
Neisseria flavescens NRL30031/H210 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EEG34481.1.
Neisseria mucosa ATCC 25996 PglL (NmuPglL) polynucleotide seqeuence. Corresponds to NCBI GenBank Accession EFC87884.1.
Neisseria mucosa ATCC 25996 PglL (NmuPglL) amino acid seqeuence. Corresponds to NCBI GenBank Accession EFC87884.1.
Neisseria sp. oral taxon 014 strain F0314 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EFI23064.1.
Neisseria sp. oral taxon 014 strain F0314 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EFI23064.1.
Neisseria arctica PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession KLT72636.1.
Neisseria arctica PglL amino acid sequence. Corresponds to NCBI GenBank Accession KLT72636.1.
Neisseria shayeganii 871 PglL (Ns2PglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EGY51766.1.
Neisseria shayeganii 871 PglL (Ns2PglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EGY51766.1.
Neisseria shayeganii 871 PglL (NsPglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EGY51593.1.
Neisseria shayeganii 871 PglL (NsPglL) amino acid sequence. Corresponds to NCBI GenBank Accession EGY51593.1.
Neisseria sp. 83E34 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession KPN72282.1.
Neisseria sp. 83E34 PglL amino acid sequence. Corresponds to NCBI GenBank Accession KPN72282.1.
Neisseria wadsworthii PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EGZ44098.1.
Neisseria wadsworthii PglL amino acid sequence. Corresponds to NCBI GenBank Accession EGZ44098.1.
Neisseria elongata subsp. glycolytica ATCC 29315 PglL (NePglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EFE49313.1.
Neisseria elongata subsp. glycolytica ATCC 29315 PglL (NePglL) amino acid sequence. Corresponds to NCBI GenBank Accession EFE49313.1.
Neisseria bacilliformis ATCC BAA-1200 PglL (NbPglL) polynucleotide sequence. Corresponds to NCBI GenBank Accession EGF10835.1.
Neisseria bacilliformis ATCC BAA-1200 PglL (NbPglL) amino acid sequence. Corresponds to NCBI GenBank Accession EGF10835.1.
Neisseria sp. oral taxon 020 str. F0370 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EKY03535.1.
Neisseria sp. oral taxon 020 str. F0370 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EKY03535.1.
Neisseria sp. 74A18 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession KPN74230.1.
Neisseria sp. 74A18 PglL amino acid sequence. Corresponds to NCBI GenBank Accession KPN74230.1.
Neisseria weaveri ATCC 51223 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EGV35010.1.
Neisseria weaveri ATCC 51223 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EGV35010.1.
Neisseria macacae ATCC 33926 PglL polynucleotide sequence. Corresponds to NCBI GenBank Accession EGQ77792.1
Neisseria macacae ATCC 33926 PglL amino acid sequence. Corresponds to NCBI GenBank Accession EGQ77792.1
rEPA1 polynucleotide sequence.
MKKIWLALAGLVLAFSASA
SAVTEYYLNHGEWPGNNTSAGVATSSEIKAE
rEPA2 polynucleotide sequence.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
YLNHGEWPGNNTSAGVATSSEIKGECAGPADSGDALLERNYPTGAEFLGD
rEPA3 polynucleotide sequence.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
AGVATSSEIK
HHHHHH
rEPA4 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A14.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKSAVTEYYLNHGEWPGNN
TSAGVATSSEIKCVLDLKDGVRSSRMSVDPAIADTNGQGVLHYSMVLEG
rEPA5 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue D36.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
rEPA6 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue Q92.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
rEPA7 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue G123.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVDP
WPGNNTSAGVATSSEIKNQLSHMSPIYTIEMGDELLAKLARDATFFVRAHE
HH
rEPA8_E157_nucleotide
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVDP
HH
rEPA9 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A177.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
YYLNHGEWPGNNTSAGVATSSEIKQAQPRREKRWSEWASGKVLCLLDPLD
rEPA10 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue Y208.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
AGVATSSEIKLAQQRCNLDDTWEGKIYRVLAGNPAKHDLDIKPTVISHR
rEPA11 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue N231.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
rEPA12 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue E252.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
rEPA13 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue 8274.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
NHGEWPGNNTSAGVATSSEIKHRQPRGWEQLEQCGYPVQRLVALYLAARL
rEPA14 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A301.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
rEPA15 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue Q307.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
SAGVATSSEIKVDQVIRNALASPGSGGDLGEAIREQPEQARLALTLAAA
rEPA16 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A365.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
HGEWPGNNTSAGVATSSEIKDVVSLTCPVAAGECAGPADSGDALLERNY
rEPA17 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue S408.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
SSEIKFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSIVFG
rEPA18 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue T418.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMS
rEPA19 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A464.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
rEPA20 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
LNHGEWPGNNTSAGVATSSEIKAPEAAGEVERLI
rEPA21 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue G525.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
AVTEYYLNHGEWPGNNTSAGVATSSEIKEVERLI
rEPA22 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue H533.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
IKPLPLRLDAITGPEEEGGRVTILGWPLAERTVV
rEPA23 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue S585.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
HGEWPGNNTSAGVATSSEIKIPDKEQAISALPDY
rEPA24_polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue K240.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
GNNTSAGVATSSEIKPTVISHRLHFPEGGSLAAL
rEPA25 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 in at residue A375.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
GNNTSAGVATSSEIKGECAGPADSGDALLERNYP
rEPA26 polynucleotide sequence—GlycotTag sequence SEQ ID NO: 145 at N-terminus.
MKKIWLALAGLVLAFSASASSAVTGYYLNHGTWP
KDNTSAGVASSPTDIKAEEAFDLWNECAKACVLD
rEPA27 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 150 at N-terminus.
MKKIWLALAGLVLAFSASASAAVVEYYSDNGTFP
AQNASAGIATASAITGKYVAKAEEAFDLWNECAK
rEPA28 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 154 at N-terminus.
MKKIWLALAGLVLAFSASASSALSEAFQTDGITG
MTAAAKAFNKTAAAGGGAGGAAAAGTQHASKAEE
HHH
rEPA29 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 157 at N-terminus.
MKKIWLALAGLVLAFSASASTLISTDATSINDLD
IAVAAWNRQANNTGANSKYVTSVAEEAFDLWNEC
rEPA30 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 160 at N-terminus.
MKKIWLALAGLVLAFSASASTPLVEAVAASSNAI
ACKNNAPWYTSSVQSGKYVSAIEPAVKAEEAFDL
rEPA31 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 163 at N-terminus.
MKKIWLALAGLVLAFSASASGAVTEYEADKGVFP
TSNASAGVAAAADINGKAEEAFDLWNECAKACVL
rEPA32 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 145 in at residue 8274.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
KDNTSAGVASSPTDIKHRQPRGWEQLEQCGYPVQ
rEPA33 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 145 in at residue S408.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKAC
NTSAGVASSPTDIKFSTRGTQNWTVERLLQAHRQ
rEPA34 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 145 in at residue A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
VTGYYLNHGTWPKDNTSAGVASSPTDIKAPEAAGEVERLIGHPLPLRLD
rEPA35 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residue S408.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
PTDIKFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSIVF
rEPA36 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residue A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
WPKDNTSAGVASSPTDIKAPEAAGEVERLIGHPLPLRLDAITGPEEEGG
rEPA37 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residues 8274 and S408.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
WPKDNTSAGVASSPTDIKHRQPRGWEQLEQCGYPVQRLVALYLAARLSW
rEPA38 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residues 8274 and A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
SAGVASSPTDIKHRQPRGWEQLEQCGYPVQRLVALYLAARLSWNQVDQVI
rEPA39 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residues S408 and A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSV
PTDIKFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSIVF
rEPA40 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 145 at N-terminus.
MKKIWLALAGLVLAFSASASSAVTGYYLNHGTWPKDNSAGVASSPTDIK
rEPA41 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 146 in at residue 8274.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVD
SAGVASSPTDIKHRQPRGWEQLEQCGYPVQRLVALYLAARLSWNQVDQVI
rEPA42 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 145 in at residues 8274 and A519.
MKKIWLALAGLVLAFSASAAEEAFDLWNECAKACVLDL
SAGVASSPTDIKHRQPRGWEQLEQCGYPVQ
PKDNTSAGVASSPTDIKAPEAAGEVERLIGHPL
rEPA43 polynucleotide sequence—GlycoTag sequence SEQ ID NO: 140 at N-terminus and C-terminus.
MKKIWLALAGLVLAFSASA
SAVTEYYLNHGEWPGNNT
SAGVATSSEIKAEEAFDLWNECAKACVLDLK
GVATSSEIK
HHHHHH
SAVTEYYLNHGEWPGNNTSAGVATSSEIKAEEAFDLW
TEYYLNHGEWPGNNTSAGVATSSEIK
HHHHHH
Neisseria meningitidis MC58 PilE amino acid sequence (mature sequence; signal sequence removed). Corresponds to NCBI Accession NP_273084.1.
Neisseria meningitidis MC58 PilE amino acid
MNTLQKGFTLIELMIVIAIVGILAAVALPAYQDYTARA
Neisseria meningitidis MC58 PilE polynucleotide
Neisseria meningitidis PilE GlycoTag amino acid sequence (corresponding to residues 45-73 of SEQ ID NO: 137; 29 amino acid long). E.g. Ser Ala Val Thr Glu Tyr Tyr Leu Asn His Gly Glu Trp Pro Gly Asn Asn Thr Ser Ala Gly Val Ala Thr Ser Ser Glu Ile Lys
Neisseria meningitidis PilE GlycoTag amino acid sequence (corresponding to residues 55-73 of SEQ ID NO: 137; 19 amino acid long). E.g. Gly Glu Trp Pro Gly Asn Asn Thr Ser Ala Gly Val Ala Thr Ser Ser Glu Ile Lys
Neisseria meningitidis PilE GlycoTag amino acid sequence (corresponding to residues 55-66 of SEQ ID NO: 137; 12 amino acid long). E.g. Gly Glu Trp Pro Gly Asn Asn Thr Ser Ala Gly Val
Neisseria gonorrhoeae Pilin (NgPilin) amino acid sequence. Corresponds to NCBI GenBank CNT62005.1.
Neisseria gonorrhoeae Pilin (NgPilin) polynucleotide sequence. Corresponds to NCBI GenBank CNT62005.1.
Neisseria gonorrhoeae GlycoTag amino acid sequence (corresponding to residues 52-81 of SEQ ID NO: 143; 30 amino acid long). E. g. Ser Ala Val Thr Gly Tyr Tyr Leu Asn His Gly Thr Trp Pro Lys Asp Asn Thr Ser Ala Gly Val Ala Ser Ser Pro Thr Asp Ile Lys
Neisseria gonorrhoeae GlycoTag amino acid sequence (corresponding to residues 62-81 of SEQ ID NO: 143; 20 amino acid long). E. g. Gly Thr Trp Pro Lys Asp Asn Thr Ser Ala Gly Val Ala Ser Ser Pro Thr Asp Ile Lys
Neisseria gonorrhoeae GlycoTag amino acid sequence (corresponding to residues 62-73 of SEQ ID NO: 143; 12 amino acid long). E.g. Gly Thr Trp Pro Lys Asp Asn Thr Ser Ala Gly Val
Neisseria lactamica 020-06 Pilin (NlPilin) amino acid sequence. Corresponds to NCBI GenBank CBN86420.1.
Neisseria lactamica 020-06 Pilin (NlPilin) polynucleotide sequence. Corresponds to NCBI GenBank CBN86420.1.
Neisseria lactamica 020-06 GlycoTag amino acid sequence (corresponding to residues 52-86 of SEQ ID NO: 148; 35 amino acid long). E.g. Ala Ala Val Val Glu Tyr Tyr Ser Asp Asn Gly Thr Phe Pro Ala Gln Asn Ala Ser Ala Gly Ile Ala Thr Ala Ser Ala Ile Thr Gly Lys Tyr Val Ala Lys
Neisseria lactamica 020-06 GlycoTag amino acid sequence (corresponding to residues 62-73 of SEQ ID NO: 148; 12 amino acid long). E.g. Gly Thr Phe Pro Ala Gln Asn Ala Ser Ala Gly Ile
Neisseria elongata subsp. glycolytica ATCC 29315 Pilin (NePilin) polynucleotide sequence. Corresponds to NCBI GenBank EFE49588.1.
Neisseria elongata subsp. glycolytica ATCC 29315 Pilin (NePilin) amino acid sequence. Corresponds to NCBI GenBank EFE49588.1. 100% identity to SEQ ID NO: 186.
Neisseria elongata subsp. glycolytica ATCC 29315 GlycoTag amino acid sequence (corresponding to residues 52-97 of SEQ ID NO: 153; 45 amino acid long).
Neisseria bacilliformis ATCC BAA-1200 (NbPilin) polynucleotide sequence. Corresponds to NCBI GenBank EGF11985.1.
Neisseria bacilliformis ATCC BAA-1200 (NbPilin) amino acid sequence. Corresponds to NCBI GenBank EGF11985.1.
Neisseria bacilliformis ATCC BAA-1200 GlycoTag amino acid sequence (corresponding to residues 57-93 of SEQ ID NO: 156; 37 amino acid long).
Neisseria mucosa ATCC 25996 (NmuPilin) polynucleotide sequence. Corresponds to NCBI GenBank EFC89512.1.
Neisseria mucosa ATCC 25996 (NmuPilin) amino acid sequence. Corresponds to NCBI GenBank EFC89512.1.
Neisseria mucosa ATCC 25996 GlycoTag amino acid sequence (corresponding to residues 52-92 of SEQ ID NO: 159; 41 amino acids long).
Neisseria shayeganii 871 (NsPilin) polynucleotide sequence. Corresponds to NCBI GenBank EGY51595.1.
Neisseria shayeganii 871 (NsPilin) amino acid sequence. Corresponds to NCBI GenBank EGY51595.1. 100% identity to SEQ ID NOs: 177 and 179.
Neisseria shayeganii 871 GlycoTag amino acid sequence (corresponding to residues 53-83 of SEQ ID NO: 162; 31 amino acids long). E.g. Gly Ala Val Thr Glu Tyr Glu Ala Asp Lys Gly Val Phe Pro Thr Ser Asn Ala Ser Ala Gly Val Ala Ala Ala Ala Asp Ile Asn Gly Lys
Neisseria shayeganii 871 GlycoTag amino acid sequence (corresponding to residues 63-74 of SEQ ID NO: 162; 12 amino acids long). E.g. Gly Val Phe Pro Thr Ser Asn Ala Ser Ala Gly Val
Neisseria lactamica ATCC 23970 Pilin amino acid sequence. Corresponds to NCBI GenBank EEZ75637.1.
Neisseria gonorrhoeae F62 Pilin amino acid sequence. Corresponds to NCBI GenBank EFF40919.1.
Neisseria cinereal ATCC 14685 Pilin amino acid sequence. Corresponds to NCBI GenBank EEZ70774.1.
Neisseria cinereal ATCC 14685 Pilin amino acid sequence. Corresponds to NCBI GenBank EEZ70775.1.
Neisseria mucosa Pilin amino acid sequence. Corresponds to NCBI GenBank KGJ31398.1.
Neisseria mucosa Pilin amino acid sequence. Corresponds to NCBI GenBank KGJ31397.1.
Neisseria flavescens NRL30031/H210 Pilin amino acid sequence. Corresponds to NCBI GenBank EEG33288.1.
Neisseria mucosa ATCC 25996 Pilin amino acid sequence. Corresponds to NCBI GenBank EFC89512.1.
Neisseria mucosa ATCC 25996 Pilin amino acid sequence. Corresponds to NCBI GenBank EFC89511.1.
Neisseria sp oral taxon 014 str. F0314 Pilin amino acid sequence. Corresponds to NCBI GenBank EFI23295.1.
Neisseria sp oral taxon 014 str. F0314 Pilin amino acid sequence. Corresponds to NCBI GenBank EFI23294.1.
Neisseria arctica Pilin amino acid sequence. Corresponds to NCBI GenBank KLT73057.1.
Neisseria shayeganii 871 Pilin amino acid sequence. Corresponds to NCBI GenBank EGY51595.1. 100% identity to SEQ ID NOs: 162 and 179.
Neisseria shayeganii 871 Pilin amino acid sequence. Corresponds to NCBI GenBank. EGY51594 (=ID 180)
Neisseria shayeganii 871 Pilin amino acid sequence. Corresponds to NCBI GenBank EGY51595.1. 100% identity to SEQ ID NOs: 162 and 177.
Neisseria shayeganii 871 Pilin amino acid sequence. Corresponds to NCBI GenBank EGY51594.1.
Neisseria sp. 83E34 Pilin amino acid sequence. Corresponds to NCBI GenBank KPN71218.1.
Neisseria sp. 83E34 Pilin amino acid sequence. Corresponds to NCBI GenBank KPN71186.1.
Neisseria wadsworthii 9715 Pilin amino acid sequence. Corresponds to NCBI GenBank EGZ51246.1.
Neisseria wadsworthii 9715 Pilin amino acid sequence. Corresponds to NCBI GenBank EGZ51247.1.
Neisseria elongata subsp. glycolytica ATCC 29315 Pilin amino acid sequence. Corresponds to NCBI GenBank EFE49587.1.
Neisseria elongata subsp. glycolytica ATCC 29315 Pilin amino acid sequence. Corresponds to NCBI GenBank EFE49588.1. 100% identity to SEQ ID NO: 153.
Neisseria bacilliformis ATCC BAA-1200 Pilin amino acid sequence. Corresponds to NCBI GenBank EGF04823.1.
Neisseria bacilliformis ATCC BAA-1200 Pilin amino acid sequence. Corresponds to NCBI GenBank EGF11985.1.
Neisseria bacilliformis ATCC BAA-1200 Pilin amino acid sequence. Corresponds to NCBI GenBank EGF12096.1.
Neisseria sp. oral taxon 020 str. F0370 Pilin amino acid sequence. Corresponds to NCBI GenBank EKY04118.1.
Neisseria sp. oral taxon 020 str. F0370 Pilin amino acid sequence. Corresponds to NCBI GenBank EKY04120.1.
Neisseria sp. 74A18 Pilin amino acid sequence. Corresponds to NCBI GenBank KPN73545.1.
Neisseria sp. 74A18 Pilin amino acid sequence. Corresponds to NCBI GenBank KPN73546.1.
Neisseria weaver ATCC 51223 Pilin amino acid sequence. Corresponds to NCBI GenBank EGV37979.1.
Neisseria macacae ATCC 33926 Pilin amino acid sequence. Corresponds to NCBI GenBank EGQ74605.1.
Neisseria macacae ATCC 33926 Pilin amino acid sequence. Corresponds to NCBI GenBank EGQ74606.1.
AcrA polynucleotide sequence (including pelB signal sequence).
MKYLLPTAAAGLLLLAAQPAMAMHMSKEEAPKIQMP
MKYLLPTAAAGLLLLAAQPAMAMHMSKEEAPKIQMP
SAVTEYYLNHGEWPGNNTSAGVATSSEIK
HHHHHH
PcrV polynucleotide sequence (including LtIIb signal sequence).
MSFKKIIKAFVIMAALVSVQAHAAEVRNLNAARELFLDELLAASAAPAS
SSEIKHHHHHH
Crm197 polynucleotide sequence (including DsbA signal sequence and GlycoTag sequence SEQ ID NO: 140 at C-terminus).
MKKIWLALAGLVLAFSASAGADDVVDSSKSFVMENFSSYHGT
SSEIK
HHHHHH
MKKIWLALAGLVLAFSASAGADDVVDSSKSFVMENFSSYHGTKPGYVDS
m2Crm197 polynucleotide sequence (including DsbA signal sequence and GlycoTag sequence SEQ ID NO: 140 at N-terminus and C-terminus)
MKKIWLALAGLVLAFSASA
SAVTEYYLNHGEWPGNNT
SAGVATSSEIKGADDVVDSSKSFVMENFSSY
IK
HHHHHH
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgT amino acid sequence corresponding to NCBI GenBank Accession No. AAG17408.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgU amino acid sequence corresponding to NCBI GenBank Accession No. AAG17409.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen Wzx amino acid sequence corresponding to NCBI GenBank Accession No. AAG17410.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen Wzy amino acid sequence corresponding to NCBI GenBank Accession No. AAG17411.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgV amino acid sequence corresponding to NCBI GenBank Accession No. AAG17412.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgW amino acid sequence corresponding to NCBI GenBank Accession No. AAG17413.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgX amino acid sequence corresponding to NCBI GenBank Accession No. AAG17414.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgY amino acid sequence corresponding to NCBI GenBank Accession No. AAG17415.1. [102]
Plesiomonas shigelloides O17 (i.e., Shigella sonnei) O-antigen WbgZ amino acid sequence corresponding to NCBI GenBank Accession No. AAG17416.1. [102]
Number | Date | Country | Kind |
---|---|---|---|
18212100.4 | Dec 2018 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2019/084632 | 12/11/2019 | WO | 00 |