The instant application contains an electronically submitted Sequence Listing in ASCII text file format (Name: VB66226WO_SeqLstg_ST25.txt; Size: 195,491 bytes; and Date of Creation: Apr. 2, 2018) which is hereby incorporated by reference in its entirety.
An inventor, the inventors, or another who obtained the subject matter from the inventor(s), have disclosed Chandramouli et al., 2017 Science Immunology 2(12): eaan1457 and Protein Data Bank (PDB) ID 5VOB.
This invention is in the field of vaccination against human cytomegalovirus (HCMV) and in particular provides mutant gH, gL, pUL128, pUL130, and pUL131A polypeptides as well as a stabilized HCMV complex comprising two or more of gH, gL, pUL128, pUL130, and pUL131A polypeptides wherein at least one of the two polypeptides is mutant, and their use in an immunogenic composition or vaccine composition.
Cytomegalovirus (CMV) is a genus of virus that belongs to the viral family known as Herpesviridae or herpesviruses. The species that infects humans is commonly known as HCMV or human herpesvirus-5 (HHV-5). Within Herpesviridae, HCMV belongs to the Betaherpesvirinae subfamily, which also includes cytomegaloviruses from other mammals.
Although they may be found throughout the body, HCMV infections are frequently associated with the salivary glands. HCMV infects between 50% and 80% of adults in the United States (40% worldwide), as indicated by the presence of antibodies in much of the general population. HCMV infection is typically unnoticed in healthy people, but can be life-threatening for the immunocompromised, such as HIV-infected persons, organ transplant recipients, or new born infants (Mocarski et al., Cytomegalovirus in F
HCMV seems to have a large impact on immune parameters in later life and may contribute to increased morbidity and eventual mortality (Simanek et al., Seropositivity to Cytomegalovirus, Inflammation, All-Cause and Cardiovascular Disease-Related Mortality in the United States, 2011 PLoS ONE 6: e16103).
To date, the genomes of over 20 different HCMV strains have been sequenced, including those of both laboratory strains and clinical isolates. For example, the following strains of HCMV have been sequenced: Towne (NCBI GenInfo (GI) identifier 239909366), AD169 (GI:219879600), Toledo (GI:290564358) and Merlin (GI:155573956). HCMV strains AD169, Towne and Merlin can be obtained from the American Type Culture Collection (ATCC VR538, ATCC VR977 and ATCC VR1590, respectively).
HCMV contains an unknown number of membrane protein complexes. Of the known glycoproteins in the viral envelope, gH and gL have emerged as particularly interesting due to their presence in several different complexes: dimeric gH/gL, trimeric gH/gL/gO (also known as the gCIII complex) and the pentameric gH/gL/pUL128/pUL130/pUL131A (the latter protein is also referred to as pUL131). HCMV is thought to use the pentameric complexes to enter epithelial and endothelial cells by endocytosis and low-pH-dependent fusion but it is thought to enter fibroblasts by direct fusion at the plasma membrane in a process involving gH/gL or possibly gH/gL/gO. The gH/gL and/or gH/gL/gO complex(es) is/are sufficient for fibroblast infection, whereas the pentameric complex is required to infect endothelial and epithelial cells (Ryckman, B J, et al., Characterization of the Human Cytomegalovirus gH/gL/UL128-131 Complex That Mediates Entry into Epithelial and Endothelial Cells, 2008 J. Virol. 82: 60-70).
The pentameric complex is considered as a major target for CMV vaccination. Viral genes UL128, UL130 and UL131 are needed for endothelial entry (Hahn et al., Human Cytomegalovirus UL131-128 Genes are Indispensable for Virus Growth in Endothelial Cells and Virus Transfer of Leukocytes, 2004 Journal of Virology 78(18): 10023-10033). Fibroblast-adapted non-endothelial tropic strains contain mutations in at least one of these three genes. Towne strain, for example, contains a two base pair insertion causing a frame shift in UL130 gene, whereas AD169 contains a one base pair insertion in UL131 gene. Both Towne and AD169 could be adapted for growth in endothelial cells, and in both instances, the frame shift mutations in UL130 or UL131 genes were repaired.
Genini et al. (Serum antibody response to the gH/gL/pUL128-131 five-protein complex of human cytomegalovirus (HCMV) in primary and reactivated HCMV infections, 2011 J. Clin. Vir. 52: 113-118.) discloses a serum antibody response to the pentameric complex of HCMV in primary and reactivated HCMV infections. The response was determined by both indirect immunofluorescence (IFA) and ELISA, using fixed or lysed epithelial (ARPE-19) cells infected with one or more adenoviral vectors, each carrying one HCMV gene and, in parallel, with a control adenovirus vector. The specificity of results was determined by the reactivity of human neutralizing monoclonal antibodies recognizing two, three, or four proteins of the complex. In 14 cases of primary infection, an IgG antibody seroconversion to the UL128-131 gene product was consistently detected within 2-4 weeks after onset of infection, while antibodies persisted for at least 12 months. The IgG antibody response to UL128-131 gene products was generally superior to the response to gH and appeared to follow the neutralizing antibody response (as determined in epithelial cells). In reactivated infections, the antibody response showed a trend reminiscent of a booster response. IgG antibodies were detected in HCMV-seropositive healthy adult controls, but not in HCMV-seronegative individuals.
Kinzler et al. (Expression and reconstitution of the gH/gL/gO complex of human cytomegalovirus, 2002 J. Clin. Vir. 25(Supp.2): 87-95) co-expressed gH, gL, and gO in insect cells using a recombinant baculovirus, but were unable to produce the gH/gL/gO tripartite complex. Instead, only gH/gL heterodimers, gH/gL heteromultimers, and gO homomultimers were detected. In contrast, co-expression of gH, gL, and gO in mammalian cells produced high molecular weight complexes that closely resemble gH/gL/gO complexes formed in HCMV infected cells. Cell surface immunofluorescence showed that these complexes are expressed and displayed on the surface of transfected cells.
U.S. Pat. No. 7,704,510 discloses that pUL131A is required for epithelial cell tropism; that pUL128 and pUL130 form a complex with gH/gL, which is incorporated into virions, and that this complex is required to infect endothelial and epithelial cells but not fibroblasts. Also, anti-CD46 antibodies were found to inhibit HCMV infection of epithelial cells.
WO 2014/005959 (also published as U.S. Pre-grant Pub. No. 2016-0159864) discloses a purified HCMV pentameric complex comprising the polypeptides gH, gL, pUL128, pUL130 and pUL131A and its use in an immunogenic composition or a vaccine.
Generally, there exists an inverse relationship between the flexibility of a protein from a mesophilic organism and the thermostability of that protein as was recently shown for the Lipase A enzyme from the mesophilic organism Bacillus subtilis (see Rathi et al., Structural rigidity and protein thermostability in variants of Lipase A from Bacillus subtilis, 2015 PLOS ONE 19(7): e0130289; DOI: 10.1371/journal.pone.0130289; 24 pages). Increased stability of antigens has been previously linked with improved immunogenicity such as, for example, for the pre-fusion conformation of the respiratory syncytial virus fusion protein (McLellan et al., Structure-Based Design of a Fusion Glycoprotein Vaccine for Respiratory Syncytial Virus, 2013 Science 342(6158): 592-598.) and the Neisseria meningitidis factor H binding protein (fHbp) (Rossi et al., Meningococcal Factor H Binding Protein Vaccine Antigens with Increased Thermal Stability and Decreased Binding of Human Factor H, 2016 Infect. Immun. 84(6): 1735-1742.).
It is expected that an improved thermostability of an HCMV complex such as the pentamer complex will have the following advantages: (i) facilitate its preparation and production, and (ii) have an impact on its use as an antigen, providing a better immunogenicity. Therefore, there is a need for developing an HCMV complex, specifically a pentameric complex, having an enhanced thermostability for suitable use in an immunogenic composition or a vaccine. Further, a glycan in close proximity to a neutralizing epitope on an HCMV complex is believed to limit the accessibility of the epitope. Therefore, there is a need for developing a deglycosylated HCMV complex, specifically a pentameric complex, having more accessible epitopes (optionally also having an enhanced thermostability) that is suitable for use in an immunogenic composition or vaccine.
This invention is based on the recombinant expression of a stabilized HCMV complex (e.g., a gH/gL, gH/gL/gO, or gH/gL/UL128/UL130/pUL131A complex) comprising two or more of gH, gL, pUL128, pUL130, and pUL131A polypeptides wherein at least one of the two polypeptides is mutant, and its use in an immunogenic composition or vaccine composition. Methods of making and using them are provided. The mutant polypeptides and the nucleic acid molecules encoding them, as well as antibodies, expression vectors, and host cells are also provided.
In one aspect of the invention, there is provided an HCMV gH polypeptide, or a complex-forming fragment thereof, comprising one or more stabilizing mutations. In another aspect of the invention, there is provided an HCMV gL polypeptide, or a complex-forming fragment thereof, comprising one or more stabilizing mutations. In a further aspect of the invention, there is provided an HCMV pUL128 polypeptide, or a complex-forming fragment thereof, comprising one or more stabilizing mutations. In a further aspect of the invention, there is provided an HCMV pUL130 polypeptide, or a complex-forming fragment thereof, comprising one or more stabilizing mutations. In a further aspect of the invention, there is provided an HCMV pUL131A polypeptide, or a complex-forming fragment thereof, comprising one or more stabilizing mutations.
In a further aspect of the invention, there is provided a pentameric complex of HCMV polypeptides comprising a gH polypeptide, a gL polypeptide, a pUL128 polypeptide, a pUL130 polypeptide and a pUL131A polypeptide, wherein at least one polypeptide comprises one or more amino acid stabilizing mutations. In a further aspect of the invention, that pentameric complex has an increased stability as compared to a non-mutant pentameric complex. In a further aspect of the invention, there is provided an immunogenic composition comprising that pentameric complex of HCMV polypeptides.
In a further aspect of the invention, there is provided a gH/gL complex of HCMV polypeptides comprising a gH polypeptide and a gL polypeptide wherein at least one of the polypeptides comprises one or more stabilizing mutations. In a further aspect of the invention, that gH/gL complex has an increased stability as compared to a non-mutant gH/gL complex. In a further aspect of the invention, there is provided an immunogenic composition comprising that gH/gL complex of HCMV polypeptides.
In a further aspect of the invention, there is provided a gH/gL/gO complex of HCMV polypeptides comprising a gH polypeptide, a gL polypeptide, and a gO polypeptide, wherein at least one of the gH and gL polypeptides comprise one or more amino acid stabilizing mutations. In a further aspect of the invention, that gH/gL/gO complex has an increased stability as compared to a non-mutant gH/gL/gO complex. In a further aspect of the invention, there is provided an immunogenic composition comprising that gH/gL/gO complex of HCMV polypeptides.
In a further aspect of the invention, the complexes of the invention are produced at high yields. For example, in processes involving growing host cells of the invention in growth medium, the protein complex of the invention may accumulate to a level of more than 0.4 mg per litre of growth medium (e.g. 0.45, 0.5, 0.55, 0.6, 0.65, 0.7, 0.75, 0.8, 0.85, 0.9, 0.95, 1.0, 1.2, 1.4, 1.6, 1.8, 2, 2.5, 3, 3.5, 4, 4.5 or 5 mg per litre of growth medium or more).
The present invention provides the following:
Revealed herein is the crystal structure of the HCMV pentameric complex made of the polypeptides gH, gL, pUL128, pUL130 and pUL131A in combination with fragment antigen binding (Fabs) of monoclonal antibodies (mAbs). The inventors analysed the crystal structure and characterize (i) the presence of small interfaces between some of the domains of the complex, (ii) the presence of several cavities at the domain interfaces, and (iii) an intrinsic flexibility or dynamics of the complex. The inventors also characterize and herein describe polypeptide modifications (i.e., modifications of polypeptides gH, gL, pUL128, pUL130, and/or pUL131A) that enhance the thermostability of the pentameric complex. In particular, the inventors provide specific amino acids in each of the polypeptides of the HCMV pentameric complex that, when modified as described herein, enhance the thermostability of a complex comprising them. Without wishing to be bound by theory, it is believed that an amino acid modification of this invention enhances complex thermostability by decreasing the conformational flexibility of the complex. This is believed to occur because, for example, (A) a mutant amino acid of this invention (the amino acid resulting from a presently described stabilizing mutation) has longer side-chains than the original amino acid (the amino acid which is innately present within the referenced, non-mutant protein sequence) and the atoms of the modified amino acid's side-chains fill buried cavities between domain interfaces and/or protein core cavities. In this way, the structural characteristic of a mutant amino acid having long side-chains effects the filling of buried cavities between domain interfaces and/or protein core cavities and results in enhanced complex stability. Such mutations may be referred to herein as “cavity-filling mutations” with the resulting mutant amino acid being referred to as a “cavity-filling mutant.” It is further believed that the amino acid modifications of this invention enhance complex thermostability by (B) “repacking” the complex via increasing contacts of neighboring residues and/or replacing unfavorable clusters of charged residues within a protein or between proteins. Such mutations may be referred to herein as “repacking mutations”, with the resulting amino acid being referred to as a “repacking mutant.” Specific repacking mutations may be referred to herein as “hydrophobic mutations” or “hydrophilic mutations”, with the resulting mutant amino acid being referred to as a “hydrophobic mutant” or “hydrophilic mutant,” respectively. In this way, the structural characteristic of a referenced mutant amino acid being hydrophobic or hydrophilic effects an increase in contacts with its neighboring residues and/or effects the reduction of unfavorable clusters of charged residues and results in enhanced complex stability. Furthermore, it is believed that the amino acid modifications of this invention enhance complex thermostability by (C) introducing disulfide bridges throughout the complex. This is believed to occur because the newly introduced disulfide bridges lock (restrain) the polypeptides and thereby reduce their dynamics. In this way, the structural characteristic of a mutant amino acid being a cysteine or otherwise a residue structurally capable of forming disulfide bridges effects the introduction of a disulfide bridge and results in enhanced complex stability. Such mutations may be referred to herein as “disulfide bridge mutations” with the resulting amino acid being referred to as a “disulfide bridge mutation.” In this way, the present invention provides modified HCMV pentameric complex proteins (gH, gL, pUL128, pUL130, and pUL131A), optionally within a HCMV complex (e.g., a pentameric, gH/gL, or gH/gL/gO complex), wherein the one or more mutant amino acid results in at least one of A-C and thereby an enhanced complex thermostability. From having analysed the HCMV pentamer complex crystal structure and its neutralizing epitopes, the inventors have selected additional glycans which are in close proximity to a neutralizing epitope and that are likewise expected to limit the accessibility of their respective HCMV pentamer epitope(s). The inventors therefore expect that by introducing one or more of the deglycosylation mutations into a HCMV complex, the corresponding epitope(s) will be more accessible as compared to those of a non-mutant. In particular, it is believed that removing one or more of the identified glycan(s) will “unmask” the corresponding epitope and increase antigenicity. The inventors specifically propose the substitution of an identified asparagine residue for any non-asparagine amino acid (e.g., glutamine) using known techniques so as to prevent N-linked glycosylation at that location and thereby unmask the corresponding epitope. In this way, the present invention provides modified HCMV pentameric complex proteins (gH, gL, pUL128, pUL130, and pUL131A), optionally within a HCMV complex (e.g., a pentameric, gH/gL, or gH/gL/gO complex), wherein the one or more mutant amino acid results in deglycosylation. Such deglycosylation may unmask an epitope, making the epitope more accessible to, for example, a neutralizing antibody or antibody fragment.
The polypeptides of the invention are mutant and therefore have a distinct structure as compared to a non-mutant (e.g., wild type or control) polypeptide. Likewise, therefore, a complex comprising a polypeptide of the invention has a distinct structure as compared to a non-mutant complex (e.g., wild type or control complex) because a complex of the invention comprises at least one mutant HCMV gH, gL, UL128, UL130, and pUL131A polypeptide. Further, the complex of the invention may have a distinct function as compared to a non-mutant (e.g., wild type or control) complex in that the complex of the invention may have an increased thermostability as compared to a non-mutant complex.
It is therefore an object of the invention to provide HCMV gH, gL, pUL128, pUL130 and pUL131A polypeptides, as defined herein, presenting the advantage of forming a complex (e.g., a pentameric, gH/gL, or gH/gL/gO complex) having an enhanced thermostability and/or more accessible epitopes. Although the present invention is applicable to polypeptides originating from any HCMV strain, in order to facilitate its understanding, when referring to amino acid positions in the present specification, the numbering is given in relation to the amino acid sequence of the polypeptides originating from the Merlin strain, unless otherwise stated. In particular, gH amino acid positions are given with respect to the Merlin gH sequence SEQ ID NO: 1; gL amino acid positions are given with respect to the Merlin gL sequence SEQ ID NO: 7; pUL128 amino acid positions are given with respect to the Merlin pUL128 sequence 13; pUL130 amino acid positions are given with respect to the Merlin pUL130 sequence SEQ ID NO: 17; pUL131A amino acid positions are given with respect to the Merlin pUL131A sequence SEQ ID NO: 21; and gO amino acid positions are given with respect to the Merlin gO sequence SEQ ID NO: 25 unless otherwise stated. The present invention is not, however, limited to the Merlin strain. Using the teachings of the invention, comparable amino acid positions in a polypeptide of any other HCMV strain can be determined easily by those of ordinary skill in the art by aligning the amino acid sequences using readily available and alignment algorithms (such as BLAST, using default settings, ClustalW2, using default settings, or algorithm disclosed by Corpet (Multiple Sequence Alignment with Hierarchical Clustering, 1998 Nucleic Acids Research 16(22): 10881-10890), using default parameters). Accordingly, when referring to an “HCMV polypeptide”, it is to be understood as an HCMV polypeptide from any strain (in addition to Merlin strain). The actual number of the amino acid residue positions may have to be adjusted for polypeptides from other strains depending on the actual sequence alignment.
A location within a sequence, position within a sequence, residue, or amino acid “corresponding to residue ## of SEQ ID NO: ###” or “that corresponds to residue ## of SEQ ID NO: ###” or “with respect to residue ## of SEQ ID NO: ###” or “at the position corresponding to ‘X’ of SEQ ID NO: ###” or “at the location that corresponds to the location of residue ## of SEQ ID NO: ###” or “numbered with respect to residue ## of SEQ ID NO: ###” are exemplary phrases used in the sense of the present invention for clarity, particularly within claims, to provide a means by which a particular position within an amino acid sequence is identified. Further, these phrases are used to encompass, for example, amino acid sequences that encode a polypeptide having the same or similar function as the protein having the referenced amino acid sequence SEQ ID NO: ### but that have a disparate structure (i.e., the encompassed polypeptide has the same or similar function as the referenced protein but the encompassed polypeptide and referenced protein have different amino acid sequences). By way of example, a polypeptide comprising a mutation at “the amino acid residue corresponding to P45 of the pUL131A sequence SEQ ID NO: 21” would encompass at least a polypeptide having a mutation at P45 of the pUL131A sequence SEQ ID NO: 21 (Merlin strain), a polypeptide having a mutation at P45 of the pUL131A sequence SEQ ID NO: 23 (Towne strain), and a mutation at P49 of the pUL131A sequence SEQ ID NO: 24 (AD169 strain) (all of which are underlined in the following multiple sequence alignment of SEQ ID NOs: 21, 23, and 24; signal sequence residues are also underlined).
MRLCRVWLSV CLCAVVLGQC QRETAEKNDY YRVPHYWDAC SR....ALPD
MRLCRVWLSV CLCAVVLGQC QRETAEKNDY YRVPHYWDAC SR....ALPD
MRLCRVWLSV CLCAVVLGQC QRETAEKKRL LPSTALLGRV LSRAARPNPL
By “HCMV complex” or simply “complex”, it is meant in the sense of the present invention a group of two or more associated proteins that comprises at least one of the HCMV polypeptides gH, gL, pUL128, pUL130, pUL131A, or complex-forming fragments thereof. An HCMV complex or “complex” in the sense of the present invention includes, for example, gH/UL115, gH/gL (e.g., the gH/gL heterodimer or a gH/gL dimer of heterodimers), gH/gL/gO, and pentameric complexes. A “mutant HCMV complex” or simply “mutant complex” in the sense of the present invention is a group of two or more associated proteins that comprise at least one of the HCMV polypeptides gH, gL, pUL128, pUL130, pUL131A, or complex-forming fragments thereof wherein the at least one HCMV polypeptide or complex-forming fragment thereof comprises one or more stabilizing mutant amino acid residues.
By “pentameric complex”, it is meant in the sense of the present invention an HCMV complex that comprises five different HCMV polypeptides: gH, gL, pUL128, pUL130 and pUL131A. Although generally referred to as gH/gL/pUL128/pUL130/pUL131A pentamer (or pentameric complex comprising gH, gL, pUL128, pUL130 and pUL131A) in the specification, each of the 5 polypeptides does not need to be a full-length polypeptide. The term “pentameric complex” also encompasses pentamers formed by complex-forming fragments of gH, gL, pUL128, pUL130 and pUL131A polypeptides.
The term “complex subprotein” or simply “subprotein” may be used in the sense of the present invention to refer to the HCMV polypeptides that are present within the referenced HCMV complex. For example, a “subprotein” of the HCMV pentameric complex means any one of gH, gL, pUL128, pUL130, and pUL131A and in this example is a synonym for a “HCMV pentameric complex protein.” In further example, a “gH/gL complex subprotein” means a gH or gL polypeptide.
By “complex-forming fragment” of an HCMV polypeptide, it is meant in the sense of the present invention any part or portion of the polypeptide that retains the ability to form a complex (e.g., the pentameric complex, gH/gL dimer, and gH/gL/gO trimer) with other HCMV polypeptides of the complex. As used herein, a “complex-forming fragment” of a mutant polypeptide comprises the one or more mutant amino acid residues (i.e., the fragment of a mutant protein comprises the mutation(s)). The ability to form a complex (e.g., a pentameric complex) of the invention can be tested by performing protein purification, and analyzing the proteins by non-reducing PAGE, Western blot and/or size exclusion chromatography. If the proteins form part of a complex, they may all be present in a single band on a native PAGE gel and/or be present in a single peak in a size exclusion chromatogram.
By “enhanced thermo-stability” or “enhanced thermostability” or “higher thermostability” or “increased thermostability” or simply “enhanced stability” or “higher stability” or “increased stability”, it is meant in the sense of the present invention that the complex (e.g., the pentameric complex) has at least a lower rate of unfolding, under the same conditions, than the same complex which does not comprise any stabilizing mutation (said another way, the complex unfolds slower than a non-mutant complex under the same conditions). “Conditions” as used herein includes, for example, experimental and physiological conditions. It may be specified that a composition comprising a stabilized complex of the present invention has an increased shelf life as compared to a composition comprising a non-mutant complex. See, e.g., U.S. Pub. No. 2011/0229507 A1, hereby incorporated by reference in its entirety; Clapp et al., Vaccines with Aluminum-Containing Adjuvants: Optimizing Vaccine Efficacy and Thermal Stability, 2011 J. Pharm. Sci. 100(2): 388-401, discussing increased stability via adjuvants and assessing antigen stability in altered pH, hydration, and temperature conditions; and Rossi et al., 2016 Infect. Immun. 84(6): 1735-1742. Stability herein may be provided by the delta stability (dStability or dS) scoring method, which is the computationally-determined difference between the relative thermostability of an in-silico mutant protein and that of the corresponding wild type or non-mutant protein. Methods of determining dStability are provided at Example 3 and include, for example, known tools such as Molecular Operating Environment (MOE) software (REF: Molecular Operating Environment (MOE) software; Chemical Computing Group Inc., available at WorldWideWeb(www).chemcomp.com). dS is measured by kcal/mol. Lower dS values indicate higher protein stability, inversely higher dS values indicate lower protein stability. It may be specified that the mutant polypeptides of the present invention have a higher relative thermostability (in kcal/mol) as compared to a non-mutant polypeptide under the same experimental conditions. It may be further specified that the mutant polypeptides of the present invention have a lower dS value than a non-mutant polypeptide under the same experimental conditions. It will be understood from the present invention that a mutant polypeptide having a lower dS value as compared to a non-mutant polypeptide under the same experimental conditions is more stable than the non-mutant polypeptide. The stability enhancement can be assessed using differential scanning calorimetry (DSC), for example as discussed in Bruylants et al. (Differential Scanning Calorimetry in Life Sciences: Thermodynamics, Stability, Molecular Recognition and Application in Drug Design, 2005 Curr. Med. Chem. 12: 2011-2020) and Calorimetry Sciences Corporation's “Characterizing Protein stability by DSC” (Life Sciences Application Note, Doc. No. 2021102136 February 2006) or by differential scanning fluorimetry (DSF). An increase in stability may be characterized as an at least about 2° C. increase in thermal transition midpoint (Tm), as assessed by DSC or DSF. See, for example, Thomas et al., Effect of single-point mutations on the stability and immunogenicity of a recombinant ricin A chain subunit vaccine antigen, 2013 Hum. Vaccin. Immunother. 9(4): 744-752. A “significant” increase in, or enhancement of, thermostability is defined as an increase of at least about 5° C. in the calculated Tm of a complex (calculated by, for example, the protocol provided at Example 4.7).
By “mutation”, it is meant in the sense of the present invention a substitution of an amino acid residue with another amino acid residue. With respect to the nucleic acid sequence, this substitution is effected via a missense mutation within the corresponding codon of the coding region (the mutant polypeptide encoded by one such mutant nucleic acid sequence may be referred to as a “mutein”). The term “mutation” as used herein also includes modifications that introduce a non-naturally occurring amino acid or an amino acid analog into a polypeptide. A “mutation” as used herein does not include an “identical mutation,” which is the substitution of an amino acid residue with a natural or synthetically produced amino acid having the same chemical structure. By way of example, the substitution of alanine at position 102 of the sequence SEQ ID NO: 1 with an alanine (A102A) is an “identical mutation” as used herein and is not within the meaning of “mutation” in the sense of the present invention. Therefore, the mutations of the present invention may be clarified with the proviso that an identical mutation is excluded.
By “stabilizing mutations” or “stabilizing mutant”, it is meant in the sense of the present invention any mutation in the HCMV polypeptides resulting, upon formation of the pentameric complex, into a complex having an enhanced thermo-stability, as compared with a pentameric complex formed with the HCMV polypeptides containing no such mutation. Stabilizing mutations in the sense of the present invention encompass, in particular, cavity-filling mutations, repacking mutations (which includes hydrophobic mutations and hydrophilic mutations), and disulfide bridge mutations. A stabilizing mutation of the present invention is not detrimental to the use of the mutated protein as an antigen. In particular, the stabilizing mutation does not prohibit all epitopes that can elicit the production of antibodies that can bind to at least a HCMV complex as described herein and/or antibodies that can neutralize the biological effects of said HCMV complex. In addition, a stabilizing mutation of the present invention does not prevent the mutant polypeptides to form a complex. By “stabilized complex”, it is meant in the sense of the present invention a complex (e.g., a pentameric complex) comprising at least one polypeptide that comprises at least one stabilizing mutation.
The terms “amino acid”, “residue”, and “amino acid residue” as used herein all refer to an organic chemical compound that contains at least one amino group (—NH2) and one carboxyl group (—COOH). For clarity, however, and particularly within the claims. the amino acid present at a particular position within a wild type or non-mutant sequence (e.g., the wild type amino acid) may be referred to as an “amino acid” whereas the compound present at that corresponding position within the mutant sequence (the mutant amino acid) may be referred to as a “residue” (i.e., the amino acid resulting from a missense mutation may be referred to as a “residue”).
By “non-mutant”, it is meant in the sense of the present invention that the referenced molecule (e.g., the sequence, polypeptide, or complex) does not comprise a stabilizing mutation or deglycosylation mutation of the present invention. In this way, the term “non-mutant” encompasses “wild type” structure but it also encompasses, for example, a truncated wild type polypeptide. For example, an HCMV gH polypeptide having the amino acid sequence of SEQ ID NO: 3 is a “non-mutant” polypeptide at least because it is truncated and therefore does not have “wild type” HCMV gH structure. Further by example, an HCMV gL polypeptide having the amino acid sequence of SEQ ID NO: 9 or SEQ ID NO: 29 is a “non-mutant” polypeptide in the sense of the present invention. To be clear, “non-mutant” is used herein as a reference to structure and not as a reference to function. A non-mutant polypeptide, for example, may be described as having wild type function (such as the HCMV gH polypeptide comprising SEQ ID NO: 3).
By “cavity-filling mutation”, it is meant in the sense of the present invention the substitution of a first amino acid (e.g., a wild type amino acid) with a second amino acid wherein the second amino acid has a longer side chain than the first amino acid. Said another way, a cavity-filling mutation herein is the substitution of an amino acid with a residue wherein the residue has a longer side chain than the amino acid. Such amino acid residues having a long side chain include: tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
By “repacking mutations”, it is meant in the sense of the present invention the substitution of a first amino acid residue (e.g., a wild type amino acid) with a second amino acid residue which increases the interaction of neighboring residues in a polypeptide. Repacking mutations include amino acid substitutions that, for example, (1) enhance hydrophobic interactions (e.g., through hydrophobic amino acids) or hydrogen-bond formation (e.g., through polar or charged (i.e., hydrophilic) amino acids), or (2) reduce unfavourable or repulsive interactions of neighboring residues (e.g., by eliminating clusters of similarly charged residues). A repacking mutation in the sense of the present invention encompasses, in particular, hydrophobic mutations and hydrophilic mutations.
By “hydrophobic mutation”, it is meant in the sense of the present invention the substitution of a first amino acid residue (e.g., a wild type amino acid) with a second amino acid residue wherein the second amino acid has a hydrophobic side chain. Such amino acid residues having a hydrophobic side chain include: tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P). Tyrosine (Y) may also be classified as hydrophobic.
By “hydrophilic mutation”, it is meant in the sense of the present invention the substitution of a first residue (e.g., a wild type amino acid residue) with a second amino acid wherein the second amino acid has a hydrophilic side chain. “Hydrophilic side chain” encompasses both a substitution to an amino acid having a polar side chain and a substitution to an amino acid having a charged side chain. It is generally understood in the art that polar amino acid side chains are hydrophilic but are not charged and charged amino acid side chains are hydrophilic and are charged. By “polar mutation”, it is meant in the sense of the present invention the substitution of a first amino acid residue (e.g., a wild type amino acid) with a second amino acid residue wherein the second amino acid has a polar side chain. Amino acid residues having a polar side chain include: serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N) and glutamine (Q). By “charged mutation”, it is meant in the sense of the present invention the substitution of a first amino acid residue (e.g., a wild type amino acid) with a second amino acid residue wherein the second amino acid has a charged side chain. Amino acid residues having a charged side chain include: arginine (R), glutamic acid (E), lysine (K), histidine (H), and aspartic acid (D).
By “disulfide bridge mutations”, it is meant in the sense of the present invention the substitution of an amino acid residue with a cysteine (C) residue, so as to form disulfide bridges within a polypeptide or between polypeptides. The present invention includes individual proteins (e.g., gH, gL, pUL128, pUL130, and pUL131A) consisting of one disulfide bridge mutation (i.e., the substitution of one amino acid with cysteine). Disulfide bridges are formed between two residues and are either an “intra-disulfide bridge” (formed between two residues within the same polypeptide) or an “inter-disulfide bridge” (formed between two residues, the first residue being within a first polypeptide and the second residue being within a second polypeptide). Therefore, to increase the stability of a complex of proteins using only disulfide bridge mutations, the disulfide bridge mutations must be introduced into the complex in pairs (multiples of 2) wherein each of the at least one pair of disulfide bridge mutations is within one or more of the complex subproteins. It may be specified that the complex comprises 2n disulfide bridge mutations wherein “n” is any positive integer including zero. It may be further specified that the complex comprises two, four, six, eight, ten, twelve, fourteen, sixteen, eighteen, twenty, twenty-two, twenty-four, twenty-six, twenty-eight, thirty, thirty-two, thirty-four, thirty-six, thirty-eight, forty, forty-two, forty-four, forty-six, or a higher multiple of 2, disulfide bridge mutations. For example, the present invention encompasses a stabilized pentameric complex consisting of one pair of disulfide bridge mutations wherein the first disulfide bridge mutation is at the residue corresponding to G116C of pUL130 having the sequence SEQ ID NO: 17 and the second disulfide bridge mutation is at the residue corresponding to H150C of pUL130 having the sequence SEQ ID NO: 17 (see Table 21 below). Further for example, the present invention encompasses a stabilized pentameric complex consisting of one pair of disulfide bridge mutations wherein the first disulfide bridge mutation is at the residue corresponding to D163C of gL having the sequence SEQ ID NO: 7 and the second disulfide bridge mutation is at the residue corresponding to P62C of pUL130 having the sequence SEQ ID NO: 17 (see Table 21 below). Further for example, the present invention includes a stabilized pentameric complex consisting of two pairs of disulfide bridge mutations wherein the first pair of disulfide bridge mutations consists of a first disulfide bridge mutation at the residue corresponding to D163C of gL having the sequence SEQ ID NO: 7 and a second disulfide bridge mutation at the residue corresponding to P62C of pUL130 having the sequence SEQ ID NO: 17 and wherein the second pair of disulfide bridge mutations consists of a third disulfide bridge mutation at the residue corresponding to V109C of gH having the sequence SEQ ID NO: 1 and a fourth disulfide bridge mutation at the residue corresponding to G224C of gL having the sequence SEQ ID NO: 7 (see Table 21 below). For a stabilized complex of the present invention comprising, for example, one repacking mutation and one disulfide bridge mutation, a person with ordinary skill in the art will recognize that it is not necessary to specify a pair of disulfide bridge mutations because the one repacking mutation alone may be sufficient to effect increased stability.
By “deglycosylation mutation(s)”, it is meant in the sense of the present invention the substitution of an asparagine amino acid (e.g., a wild type asparagine amino acid) with a second amino acid which thereby prevents the addition of a glycan to an asparagine nitrogen atom at that residue location (i.e., prevents N-linked glycosylation at that location). The second amino acid may be any non-asparagine residue. It may therefore be specified that a polypeptide or complex comprising a deglycosylation mutation comprises a glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), or aspartate (D) residue where a non-mutant (e.g., wild type or control) polypeptide or complex, respectively, comprises an asparagine (N) residue. An HCMV polypeptide, or complex-forming fragment thereof, or an HCMV complex of the present invention may comprise one or more deglycosylation mutations or may comprise a combination of one or more stabilizing mutations (i.e., a cavity-filling, repacking, and/or disulfide bridge mutation) and one or more deglycosylation mutations.
“Antigenicity” is used herein to refer to an antigen's ability to combine with an antibody, for example, to bind to a neutralizing antibody. An “increased antigencity” or “enhanced antigencity” therefore encompasses, for example, an increased binding affinity of a neutralizing antibody for the mutant antigen as compared to its binding affinity for a non-mutant antigen. An increased binding affinity may be provided as a decreased dissociation constant (Kd) value (in nM). See generally, e.g., Ma et al. Envelope Deglycosylation Enhances Antigenicity of HIV-1 gp41 Epitopes for Both Broad Neutralizing Antibodies and Their Unmutated Ancestor Antibodies, 2011 PLoS Path. 7(9), e1002200.
“Immunogenicity” is used herein to refer to an antigen's ability to induce an immune response. See generally, e.g., Ma et al., 2011 PLoS Path. 7(9), e1002200.
As used in the present disclosure and claims, the singular forms “a,” “an,” and “the” include plural forms unless the context clearly dictates otherwise; i.e., “a” means “one or more” unless indicated otherwise.
The terms “about” or “approximately” mean roughly, around, or in the regions of. The terms “about” or “approximately” further mean within an acceptable contextual error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e. the limitations of the measurement system or the degree of precision required for a particular purpose, e.g. the amount of a complex within media. When the terms “about” or “approximately” are used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. For example, “between about 5.5 to 6.5 mg/ml” means the boundaries of the numerical range extend below 5.5 and above 6.5 so that the particular value in question achieves the same functional result as within the range. For example, “about” and “approximately” can mean within 1 or more than 1 standard deviation as per the practice in the art. Alternatively, “about” and “approximately” can mean a range of up to 20%, preferably up to 10%, more preferably up to 5%, and more preferably up to 1% of a given value.
The term “and/or” as used in a phrase such as “A and/or B” is intended to include “A and B,” “A or B,” “A,” and “B.” Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following embodiments: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
Unless specified otherwise, all of the designations “A %-B %,” “A-B %,” “A % to B %,” “A to B %,” “A %-B,” “A % to B” are given their ordinary and customary meaning. In some embodiments, these designations are synonyms.
The term “comprising” encompasses “including” as well as “consisting” e.g. a composition “comprising” X may consist exclusively of X or may include something additional e.g. X+Y. The terms “comprising” and “having” when used as a transition phrase herein are open-ended whereas the term “consisting of” when used as a transition phrase herein is closed (i.e., limited to that which is listed and nothing more).
The word “substantially” does not exclude “completely” e.g. a composition which is “substantially free” from Y may be completely free from Y. Where necessary, the word “substantially” may be omitted from the definition of the invention.
An “effective amount”, such as in “a therapeutically effective amount of an antigen and/or adjuvant”, means an amount sufficient to cause the referenced effect or outcome. An “effective amount” can be determined empirically and in a routine manner using known techniques in relation to the stated purpose.
Unless specifically stated, a process comprising a step of mixing two or more components does not require any specific order of mixing. Thus components can be mixed in any order. Where there are three components then two components can be combined with each other, and then the combination may be combined with the third component, etc. Similarly, while steps of a method may be numbered (such as (1), (2), (3), etc. or (i), (ii), (iii)), the numbering of the steps does not mean that the steps must be performed in that order (i.e., step 1 then step 2 then step 3, etc.). The word “then” may be used to specify the order of a method's steps.
“Recombinant” as used herein to describe a polynucleotide means a polynucleotide of genomic, cDNA, RNA (including mRNA) semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all or a portion of the polynucleotide with which it is associated in nature; and/or (2) is linked to a polynucleotide other than that to which it is linked in nature. The term “recombinant” as used with respect to a protein or polypeptide means a polypeptide produced by expression of a recombinant polynucleotide. When a nucleic acid molecule is operably linked to another polynucleotide that it is not associated with in nature, the nucleic acid molecule may be referred to as “heterologous” (i.e., the nucleic acid molecule is heterologous to at least the polynucleotide). Similarly, when a polypeptide is in contact with or in a complex with another protein that it is not associated with in nature, the polypeptide may be referred to as “heterologous” (i.e., the polypeptide is heterologous to the protein). Further, when a host cell comprises a nucleic acid molecule or polypeptide that it does not naturally comprise, the nucleic acid molecule and polypeptide may be referred to as “heterologous” (i.e., the nucleic acid molecule is heterologous to the host cell and the polypeptide is heterologous to the host cell).
Sequence identity between polypeptide sequences is preferably determined by pairwise alignment algorithm using the Needleman-Wunsch global alignment algorithm (Needleman and Wunsch, A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins, 1970 J. Mol. Biol. 48(3): 443-453), using default parameters (e.g. with Gap opening penalty=10.0, and with Gap extension penalty=0.5, using the EBLOSUM62 scoring matrix). This algorithm is conveniently implemented in the needle tool in the EMBOSS package (Rice et al., EMBOSS: The European Molecular Biology Open Software Suite, 2000 Trends Genetics 16: 276-277). Sequence identity should be calculated over the entire length of the polypeptide sequence of the invention.
HCMV glycoprotein H (gH), which is encoded by the UL75 gene, is a virion glycoprotein that is essential for infectivity and which is conserved among members of the alpha-, beta- and gamma-herpesviruses.
The gH from HCMV strain Merlin has been reported (NCBI GI:52139248 (which is also NCBI GenBank Accession No. YP_081523.1), SEQ ID NO: 1) to consist of 742 amino acids. The gH from HCMV strain Towne (NCBI GI:138314 which is also NCBI UniProtKB Accession No. P17176.1; SEQ ID NO: 5) also consists of 742 amino acids (SEQ ID NO: 5). The gH from HCMV strain AD169 is published as NCBI UniProtKB Accession No. P12824.1 (herein SEQ ID NO: 6). HCMV gH has been reported to have six N-glycosylation sites (at residues 55, 62, 67, 192, 641 and 700), and consists of a hydrophobic signal sequence at its N-terminus (amino acid residues 1-23 of SEQ ID NO: 1), an ectodomain (residues 24-717 of SEQ ID NO: 1) that projects out of the cell into the extracellular space, a hydrophobic TM domain (residues 718-736 of SEQ ID NO: 1) and a C-terminal cytoplasmic domain (residues 737-742 of SEQ ID NO: 1).
The ectodomain of gH corresponds to the portion of gH which lacks the hydrophobic TM. The location and length of the ectodomain can be predicted based on pairwise alignment of a given sequence to SEQ ID NO: 1, for example by aligning the amino acid sequence of a gH polypeptide of interest to SEQ ID NO: 1 and identifying the sequence that aligns to residues 24-717 of SEQ ID NO: 1. Similarly, the locations of the TM and C-terminal domains can be predicted by aligning the amino acid sequence of a gH polypeptide of interest to SEQ ID NO: 1 and identifying the sequences that align to residues 718-736 and 737-742 of SEQ ID NO: 1, respectively. Alternatively, the location and length of the ectodomain, the signal sequence and the TM domain can be predicted based on computational analysis of the hydrophobicity along the length of a given gH protein sequence. The signal sequence and the TM domain have the highest levels of hydrophobicity and these two regions flank the ectodomain, which is less hydrophobic. The absence of a TM domain means that the modified polypeptide cannot reside within a lipid bilayer. In some embodiments, the gH polypeptide lacks the full-length natural TM domain; in other embodiments, it can retain a portion of the natural TM domain, but not enough to let the protein reside in a lipid bilayer. Thus, the polypeptide can contain up to 10 amino acids (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 amino acids) of the natural gH TM domain.
Typically, the N-terminal signal sequence of gH polypeptides is cleaved by a host cell signal peptidase to produce mature gH proteins. In a preferred embodiment, the HCMV gH polypeptide mutated in accordance with the present invention lack an N-terminal signal sequence. An example of HCMV gH polypeptide lacking the N-terminal sequence is SEQ ID NO: 2. In a further preferred embodiment, the HCMV gH polypeptide mutated in accordance with the present invention lacks an N-terminal signal sequence, the transmembrane (TM) domain, and the C-terminal domain. Expression of the full-length UL75 gene sequence hinders purification of soluble complexes comprising gH. Rather, complexes comprising gH can be purified at high yield and purity by omitting at least a portion of the TM domain of gH. For example, constructs encoding just the N-terminal signal sequence and the ectodomain of gH (or a majority of the gH ectodomain), but not the TM domain can be used to express a form of gH which is easily purified (see, e.g., WO 2014/005959, also published as U.S. Pre-grant Pub. No. 2016-0159864). Said constructs may encode the majority (e.g. 70%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, 99.5% or more) of the ectodomain of gH, but none or only a small portion of the TM domain. gH polypeptides of the invention may include the whole of the gH ectodomain or a truncated form of the gH ectodomain (such as the gH polypeptide consisting of SEQ ID NOs: 3 or 4 which do not comprise residues 716 or 717 of the ectodomain of SEQ ID NO: 1 and also do not comprise either the TM or C-terminal domains). Said truncated forms of the ectodomain may lack between 1 and 20 amino acids (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 amino acid residues) at their N-termini and/or C-termini relative to a full-length HCMV gH protein. In alternative embodiments, the HCMV gH polypeptides mutated in accordance with the present invention lack the N-terminal signal sequence, the TM domain and the C-terminal domain. An example of a preferred HCMV gH polypeptide for use in the invention is SEQ ID NO: 3, which has a truncated ectodomain, lacks the transmembrane (TM) domain, and lacks the C-terminal cytoplasmic domain of gH sequence SEQ ID NO: 1. SEQ ID NO: 3 consists of amino acid residues 1-715 of SEQ ID NO: 1. An example of a preferred gH protein of the invention is SEQ ID NO: 4, which lacks the N-terminal signal sequence, has a truncated ectodomain, lacks the TM domain, and lacks the C-terminal domain of gH sequence SEQ ID NO: 1. SEQ ID NO: 4 consists of amino acid residues 24-715 of SEQ ID NO: 1.
As shown within the Examples, gH polypeptides are glycosylated (comprise glycans via N-linked glycosylation) at six asparagine residues: N55, N62, N67, N192, N641, and N700 numbered with respect to gH amino acid sequence SEQ ID NO: 1. An HCMV gH polypeptide, or complex-forming fragment thereof, for use in the invention may comprise a deglycosylation mutation at one or more of the asparagines located at residues 55, 62, 67, 192, 641, and 700 numbered with respect to gH sequence SEQ ID NO: 1. The deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), or aspartate (D). In particular, the deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), or alanine (A). Further in particular, the deglycosylation mutation may be glutamine (Q).
gH proteins of the invention may contain additional amino acid residues, such as N-terminal or C-terminal extensions. Such extensions may include one or more tags, which can facilitate detection (e.g. an epitope tag for detection by monoclonal antibodies) and/or purification (e.g. a polyhistidine-tag to allow purification on a nickel-chelating resin) of the gH protein. For example, gH proteins of the invention may comprise a truncated gH ectodomain fused to a C-terminal extension (see, e.g., WO 2014/005959, also published as U.S. Pre-grant Pub. No. 2016-0159864).
gH proteins of the invention can have various degrees of identity to SEQ ID NO: 1, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 1. gH proteins of the invention can have various degrees of identity to SEQ ID NO: 2, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 2. gH proteins of the invention can have various degrees of identity to SEQ ID NO: 3, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 3. gH proteins of the invention can have various degrees of identity to SEQ ID NO: 4, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 4. gH proteins of the invention can have various degrees of identity to SEQ ID NO: 5, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 5. gH proteins of the invention can have various degrees of identity to SEQ ID NO: 6, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 6. Preferred gH proteins or fragments thereof: (i) can dimerize with HCMV gL; (ii) form part of the trimeric gH/gL/gO complex; (iii) form part of the pentameric gH/gL/pUL128/pUL130/pUL131A complex; (iv) comprise at least one epitope from SEQ ID NO: 1, 2, 3, 4, 5, or 6 and/or (v) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion. An exemplary complex-forming fragment of gH comprises residues Arg1 through Leu125 of SEQ ID NO: 2 which, when in complex with full length gL (perhaps using the flexible C-terminus of gL as a linker), full length pUL128, full length pUL130, and full length pUL131A; may form a truncated HCMV pentamer complex that nonetheless maintains the five conformational epitope sites described in
The HCMV gH polypeptides, or complex-forming fragments thereof, of the invention comprise one or more stabilizing mutations, suitably, one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. Any of the gH polypeptides having the sequence as set forth in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4, or any gH polypeptide having a sequence at least 90% identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4 may suitably comprise any one or more stabilizing mutations as identified and defined herein. Accordingly, in some embodiments, the HCMV gH polypeptide having the sequence set forth in SEQ ID NO: 1, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV gH polypeptide having the sequence set forth in SEQ ID NO: 2, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gH polypeptide having the sequence set forth in SEQ ID NO: 3, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gH polypeptide having the sequence set forth in SEQ ID NO: 4, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof.
As further detailed in Example 3 and illustrated in
The identification of relevant amino acid residues, in the HCMV gH polypeptide, to mutate for cavity-filling and/or repacking and/or disulfide bridges may be performed, for example, by both visual inspection of the three-dimensional structure with the aid of molecular graphics softwares, or by using any appropriate in-silico mutagenesis method. For example, softwares, such as Molecular Operating Environment (MOE) (edited by Chemical Computing Group Inc.) allows for a systematic analysis of amino acid sequences to identify specific regions in the polypeptide or specific amino acids, the mutation of which is predicted to enhance thermo-stability.
Suitable non-limiting exemplary amino acid residues to mutate in HCMV gH polypeptides are provided in Table 1, using the sequence set forth as SEQ ID NO: 1 as a reference only. Mutations at similar positions in other HCMV gH polypeptides, such as for example, polypeptides having the sequence as set forth in SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4, or gH polypeptides originating from HCMV strains different from the Merlin strain, are also contemplated in the present invention. It is within the skilled person's abilities to determine such similar positions in other HCMV gH polypeptides. Comparable amino acid positions in a given HCMV polypeptide can be determined by aligning the amino acid sequences using readily available and well-known alignment and algorithms (such as BLAST or ClustalW2). The actual number of the amino acid position may have to be adjusted for other HCMV gH polypeptides depending on the actual sequence alignment.
Accordingly, in some embodiments, amino acids to mutate in HCMV gH polypeptide for cavity filling are A102, A372, A352, or L257 of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 102, 372, 352, and 257 of SEQ ID NO: 1 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
Suitable non-limiting exemplary amino acids to mutate in HCMV gH polypeptides for hydrophobic mutations are H252, K404, R255, E355, H480, S601, or R405, of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions 252, 404, 255, 355, 480, 601, and 405 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P).
Suitable non-limiting exemplary amino acids to mutate in HCMV gH polypeptides for hydrophilic mutations are G358 or H275 of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), glutamine (Q), arginine (R), and glutamic acid (E). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to G358 and H275 of SEQ ID NO: 1 with amino acids selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), glutamine (Q), arginine (R), and glutamic acid (E). Suitable non-limiting exemplary amino acids to mutate in HCMV gH polypeptides for polar mutations are G358 or H275 of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine N, and glutamine (Q). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to G358 and H275 of SEQ ID NO: 1 with amino acids selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), and glutamine (Q). Suitable non-limiting exemplary amino acids to mutate in HCMV gH polypeptides for charged mutations are G358 or H275 of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with the amino acid residue arginine (R) or glutamic acid (E). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to G358 and H275 of SEQ ID NO: 1 with the amino acid arginine (R) or glutamic acid (E).
Suitable non-limiting exemplary amino acids to mutate in HCMV gH polypeptides for disulphide bridge mutations are V109 or LIII of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Accordingly, in some embodiments, the HCMV polypeptides of the invention comprise at least a substitution of the residue corresponding to V109 of SEQ ID NO: 1 with a cysteine (C) or of the residue corresponding to L111 of SEQ ID NO: 1 with a cysteine (C), suitably, both.
Accordingly, in some embodiments, amino acids to mutate in HCMV gH polypeptides for deglycosylation are N55, N62, N67, N192, N641, and N700 of the sequence set forth in SEQ ID NO: 1, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 55, 62, 67, 192, 641, and 700 of SEQ ID NO: 1 with amino acids selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D).
HCMV glycoprotein L (gL), which is encoded by the UL115 gene is thought to be essential for viral replication. All known functional properties of gL are directly associated with its dimerization with gH. The gL/gH complex is required for the fusion of viral and plasma membranes leading to virus entry into the host cell. gL from HCMV strain Merlin (NCBI GI:39842115 (which is also NCBI GenBank Accession No. AAR31659.1), SEQ ID NO: 7) and HCMV strain Towne (NCBI GI:239909463 which is also NCBI GenBank Accession No. ACS32410.1; SEQ ID NO: 11 herein) have been reported to be 278 amino acids in length. gL from HCMV strain AD169 (NCBI GI:2506510 which is also NCBI UniProtKB Accession No. P16832.2; SEQ ID NO: 12 herein) has been reported to be 278 amino acids in length, include a signal sequence at its N-terminus (amino acid residues 1-35), have two N-glycosylation sites (at residues 74 and 114) and lack a TM domain (Rigoutsos et al., In silico pattern-based analysis of the human cytomegalovirus genome, 2003 J. of Virology 77: 4326-44). Sequencing of the full-length gL gene from 22 to 39 clinical isolates, as well as laboratory strains AD169, Towne and Toledo revealed less than 2% variation in the amino acid sequences among the isolates (Rasmussen et al., The Genes Encoding the gCIII Complex of Human Cytomegalovirus Exist in Highly Diverse Combinations in Clinical Isolates, 2002 J. of Virology 76: 10841-10888). Typically, the N-terminal signal sequence of gL proteins is cleaved by a host cell signal peptidase to produce mature gL polypeptides. The gL polypeptides for use in the invention may lack an N-terminal signal sequence. An example of HCMV gL polypeptide lacking the N-terminal sequence is SEQ ID NO: 8, which consists of amino acid residues 31-278 of SEQ ID NO: 7. An example of a preferred gL polypeptide for use in the invention is SEQ ID NO: 9 or 10, which comprise an LSG mutation at what is believed to be a protease recognition site, wherein said mutation reduces protease cleavage. An example of a preferred gL polypeptide for use in the invention is SEQ ID NO: 29 or 30, which comprise an IDG mutation at what is believed to be a protease recognition site, wherein said mutation reduces protease cleavage.
As shown within the Examples, gL polypeptides are glycosylated (comprise glycans via N-linked glycosylation) at one asparagine residue, N74 numbered with respect to gL amino acid sequence SEQ ID NO: 7. An HCMV gL polypeptide, or complex-forming fragment thereof, for use in the invention may comprise a deglycosylation mutation at residue 74 numbered with respect to gL sequence SEQ ID NO: 7. The deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), or aspartate (D). In particular, the deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), or alanine (A). Further in particular, the deglycosylation mutation may be glutamine (Q).
gL proteins of the invention can have various degrees of identity to SEQ ID NO: 7, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 7. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 8, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 8. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 9, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 9. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 10, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 10. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 11, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 11. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 12, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 12. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 29, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 29. gL proteins of the invention can have various degrees of identity to SEQ ID NO: 30, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 30. Preferred gL proteins or fragments thereof: (i) can dimerize with HCMV gH; (ii) form part of the trimeric gH/gL/gO complex; (iii) form part of the pentameric gH/gL/pUL128/pUL130/pUL131A complex; (iv) comprise at least one epitope from SEQ ID NOs: 7, 8, 9, 10, 11, 12, 29, or 30; and/or (v) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion. An exemplary complex-forming fragment of gL comprises residues Thr76 through Tyr169 of SEQ ID NO: 7 which, when in complex with full length pUL128, full length pUL130, and full length pUL131A (i.e., gH is not present); may form a truncated HCMV pentamer complex that nonetheless maintains the five conformational epitope sites described in
The HCMV gL polypeptides, or complex-forming fragments thereof, of the invention comprise one or more stabilizing mutations, suitably, one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. Any of the gL polypeptides having the sequence as set forth in SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 29, or SEQ ID NO: 30 or any gL polypeptide having a sequence at least 90% identical to SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 29, or SEQ ID NO: 30, or complex-forming fragments thereof, may suitably comprise any one or more stabilizing mutations as identified and defined herein. Accordingly, in some embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 7, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 8, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 9, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 10, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 29, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV gL polypeptide having the sequence set forth in SEQ ID NO: 30, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof.
As further detailed in Example 3 and illustrated in
The identification of relevant amino acid residues, in the HCMV gL polypeptide, to mutate for cavity-filling and/or repacking and/or disulfide bridges may be performed, for example, by both visual inspection of the three-dimensional structure with the aid of molecular graphics softwares, or by using any appropriate in-silico mutagenesis method. For example, softwares, such as Molecular Operating Environment (MOE) (edited by Chemical Computing Group Inc.) allows for a systematic analysis of amino acid sequences to identify specific regions in the polypeptide or specific amino acids, the mutation of which is predicted to enhance thermo-stability.
Suitable non-limiting exemplary amino acid residues to mutate in HCMV gL polypeptides are provided in Table 2, using the sequence set forth as SEQ ID NO: 7 as a reference only. Mutations at similar positions in other HCMV gL polypeptides, such as for example, polypeptides having the sequence as set forth in SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 29, or SEQ ID NO: 30, or gL polypeptides originating from HCMV strains different from the Merlin strain, are also contemplated in the present invention. It is within the skilled person's abilities to determine such similar positions in other HCMV gL polypeptides. Comparable amino acid positions in a given HCMV gL polypeptide can be determined by aligning the amino acid sequences using readily available and well-known alignment and algorithms (such as BLAST or ClustalW2). The actual number of the amino acid position may have to be adjusted for other HCMV gL polypeptides depending on the actual sequence alignment.
Accordingly, in some embodiments, an amino acid to mutate in HCMV gL polypeptides for deglycosylation is N74 of the sequence set forth in SEQ ID NO: 7, or at a corresponding position in HCMV gL polypeptides originating from different HCMV strains. Suitably, the mutation of this amino acid residue may consist of substituting it with an amino acid residue selected from the group consisting of glutamine (Q), serine ( ), threonine (T), alanine (A), glutamate ( ), and aspartate (D). Accordingly, in some embodiments, the HCMV gL polypeptides of the invention comprise at least an amino acid substitution at the position corresponding to 74 of SEQ ID NO: 7 with an amino acid selected from the group consisting of glutamine (Q), serine (5), threonine (T), alanine (A), glutamate (E), and aspartate
Accordingly, in some embodiments, amino acids to mutate in HCMV gL polypeptides for cavity filling are H177, G224, G140, G145, D146, G218, L119, C233 or P272 of the sequence set forth in SEQ ID NO: 7, or at a corresponding position in HCMV gL polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L). Accordingly, in some embodiments, the HCMV gL polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 177, 224, 140, 145, 146, 218, 119, 233 and 272 of SEQ ID NO: 7 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
Suitable non-limiting exemplary amino acids to mutate in HCMV gL polypeptides for hydrophobic mutations are H267, H236, H245, G161, or C233 of the sequence set forth in SEQ ID NO: 7, or at a corresponding position in HCMV gL polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V), and proline (P). Accordingly, in some embodiments, the HCMV gL polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 267, 236, 245, 161, and 233 of SEQ ID NO: 7 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P).
Suitable non-limiting exemplary amino acids to mutate in HCMV gL polypeptides for disulphide bridge mutations are G161, D163, G224, G218, R166, G140, R160, or A150 of the sequence set forth in SEQ ID NO: 7, or at a corresponding position in HCMV gL polypeptides originating from different HCMV strains. Accordingly, in some embodiments, the HCMV gL polypeptides of the invention comprise at least a substitution selected from the group consisting of: G161C, D163C, G224C, G218C, R166C, G140C, R160C, A150C, and combinations thereof.
pUL128 Polypeptide
The pUL128 (or simply “UL128”) from HCMV strain Merlin has been reported (NCBI GI:39842124 (which is also NCBI GenBank Accession No. AAR31668.1), SEQ ID NO: 13) to consist of 130 amino acids and to contain a one (1) nucleotide substitution causing premature termination. The pUL128 from HCMV strains Towne (NCBI GI:39841882 (which is also NCBI GenBank Accession No. AAR31451.1), SEQ ID NO: 15) and AD169 (NCBI GI:59803078 (which is also NCBI UniProtKB Accession No. P16837.2), SEQ ID NO: 16) have been reported to consist of 171 amino acids. Due to the premature termination of SEQ ID NOs: 13, 15 and 17 only share 75% identity over the full-length of SEQ ID NO: 13. pUL128 is predicted to have an N-terminal signal sequence, which is located at residues 1-27 of SEQ ID NO: 13, but it is predicted to lack a TM domain. Typically, the N-terminal signal sequence of pUL128 polypeptides is cleaved by a host cell signal peptidase to produce mature pUL128 polypeptides. The pUL128 polypeptides for use in the invention may lack an N-terminal signal sequence. An example of HCMV pUL128 polypeptide lacking the N-terminal sequence is SEQ ID NO: 14, which consists of amino acid residues 28-130 of SEQ ID NO: 13.
pUL128 proteins of the invention can have various degrees of identity to SEQ ID NO: 13, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 13. pUL128 proteins of the invention can have various degrees of identity to SEQ ID NO: 14, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 14. pUL128 proteins of the invention can have various degrees of identity to SEQ ID NO: 15, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 15. pUL128 proteins of the invention can have various degrees of identity to SEQ ID NO: 16, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 16. Preferred pUL128 proteins or fragments thereof: (i) can form part of the pentameric gH/gL/pUL128/pUL130/pUL131A complex, (ii) comprise at least one epitope of SEQ ID NOs: 13, 14, 15, or 16, and/or (iii) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion.
The HCMV pUL128 polypeptides, or complex-forming fragments thereof, of the invention comprise one or more stabilizing mutations, suitably, one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. Any of the pUL128 polypeptides having the sequence as set forth in SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16, or any pUL128 polypeptide having a sequence at least 90% identical to SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or SEQ ID NO: 16, or complex-forming fragments thereof, may suitably comprise any one or more stabilizing mutations as identified and defined herein. Accordingly, in some embodiments, the HCMV pUL128 polypeptide having the sequence set forth in SEQ ID NO: 13, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL128 polypeptide having the sequence set forth in SEQ ID NO: 14, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV pUL128 polypeptide having the sequence set forth in SEQ ID NO: 15, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof. In further alternative embodiments, the HCMV pUL128 polypeptide having the sequence set forth in SEQ ID NO: 16, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulphide bridge mutations, or any combination of one or more thereof.
As further detailed in Example 3 and illustrated in
The identification of relevant amino acid residues, in the HCMV pUL128 polypeptide, to mutate for cavity-filling and/or repacking and/or disulfide bridges may be performed, for example, by both visual inspection of the three-dimensional structure with the aid of molecular graphics softwares, or by using any appropriate in-silico mutagenesis method. Such methods are known to the skilled person. For example, softwares, such as Molecular Operating Environment (MOE) (edited by Chemical Computing Group Inc.) allows for a systematic analysis of amino acid sequences to identify specific regions in the polypeptide or specific amino acids, the mutation of which is predicted to enhance thermo-stability.
Suitable non-limiting exemplary amino acid residues to mutate in HCMV pUL128 polypeptides are provided in Table 3, using the sequence set forth as SEQ ID NO: 13 as a reference only. Mutations at similar positions in other HCMV pUL128 polypeptides, such as for example, polypeptides having the sequence as set forth in SEQ ID NO: 14, or gL polypeptides originating from HCMV strains different from the Merlin strain, such as SEQ ID NO: 15 or SEQ ID NO: 16, are also contemplated in the present invention. It is within the skilled person's abilities to determine such similar positions in other HCMV pUL128 polypeptides. Comparable amino acid positions in a given HCMV pUL128 polypeptide can be determined by aligning the amino acid sequences using readily available and well-known alignment and algorithms (such as BLAST or ClustalW2). The actual number of the amino acid position may have to be adjusted for other HCMV pUL128 polypeptides depending on the actual sequence alignment.
Accordingly, in some embodiments, amino acids to mutate in HCMV pUL128 polypeptides for cavity filling are G123, V77, L103 or Q119 of the sequence set forth in SEQ ID NO: 13, or at a corresponding position in HCMV pUL128 polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L). Accordingly, in some embodiments, the HCMV pUL128 polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 123, 77, 103, and 119 of SEQ ID NO: 13 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL128 polypeptides for hydrophobic mutations are G145, H90, or G112 of the sequence set forth in SEQ ID NO: 13, or at a corresponding position in HCMV pUL128 polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P). Accordingly, in some embodiments, the HCMV pUL128 polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 145, 90, and 112 of SEQ ID NO: 13 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL128 polypeptides for disulphide bridge mutations are R142, N99, Y98, A124, G126, L159, D45, V88, M48, G107, R51, D106, or S83 of the sequence set forth in SEQ ID NO: 13, or at a corresponding position in HCMV pUL128 polypeptides originating from different HCMV strains. Accordingly, in some embodiments, the HCMV pUL128 polypeptides of the invention comprise at least a substitution selected from the group consisting of: R142C, N99C, Y98C, A124C, G126C, L159C, D45C, V88C, M48C, G107C, R51C, D106C, S83C, and combinations thereof.
pUL130 Polypeptide
pUL130 (or simply “UL130”) is the central and the largest (214 codons) gene of the HCMV UL131A-128 locus. The sequence of pUL130 from HCMV strain Merlin is publically available (NCBI GI: 39842125 (which is also NCBI GenBank Accession No. AAR31669.1) and SEQ ID NO: 17 herein. The sequence of pUL130 from HCMV strain Towne is publically available (NCBI GI:239909473 (which is also NCBI ACS32420.1) and SEQ ID NO: 19 herein). Likewise, the sequence of pUL130 from HCMV strain AD169 is publically available (NCBI UniprotKB Accession No. P16772 and SEQ ID NIO: 20 herein). The Merlin and Towne pUL130 sequences consist of 214 and 229 amino acids, respectively. Merlin pUL130 sequence SEQ ID NO: 17 comprises a 25 amino acid long N-terminal signal sequence at residues 1-25 that precedes a hydrophilic protein containing two potential N-linked glycosylation sites (Asn85 and Asn118) within a putative chemokine domain (amino acids 46 to 120) and an additional N-glycosylation site (Asn201) close to the end of a unique C-terminal region. pUL130 is predicted to lack a TM domain.
Typically, the N-terminal signal sequence of pUL130 polypeptides is cleaved by a host cell signal peptidase to produce mature pUL130 proteins. The pUL130 polypeptides for use in the invention may lack an N-terminal signal sequence. An example of HCMV pUL130 polypeptide lacking the N-terminal sequence is SEQ ID NO: 18, which consists of amino acid residues 26-214 of SEQ ID NO: 17.
As shown within the Examples, pUL130 polypeptides are glycosylated (comprise glycans via N-linked glycosylation) at three asparagine residues: N85, N118, and N201 numbered with respect to pUL130 amino acid sequence SEQ ID NO: 17. An HCMV pUL130 polypeptide, or complex-forming fragment thereof, for use in the invention may comprise a deglycosylation mutation at one or more of the asparagines located at residues 85, 118, and 201 numbered with respect to pUL130 sequence SEQ ID NO: 17. The deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), or aspartate (D). In particular, the deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), or alanine (A). Further in particular, the deglycosylation mutation may be glutamine (Q).
pUL130 proteins of the invention can have various degrees of identity to SEQ ID NO: 17, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 17. pUL130 proteins of the invention can have various degrees of identity to SEQ ID NO: 18, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 18. pUL130 proteins of the invention can have various degrees of identity to SEQ ID NO: 19, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 19. pUL130 proteins of the invention can have various degrees of identity to SEQ ID NO: 20, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 20. Preferred pUL130 proteins or fragments thereof: (i) can form a pentameric gH/gL/pUL128/pUL130/pUL131A complex; (ii) comprise at least one epitope of SEQ ID NO: 17, 18, 19, or 20; and/or (iii) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion. An exemplary complex-forming fragment of pUL130 comprises residues Thr45 through Val214 of SEQ ID NO: 17 which, when in complex with full length full length pUL131A (i.e., none of gH, gL, or pUL128 are present); may form a truncated HCMV pentamer complex that nonetheless maintains the five conformational epitope sites described in
The HCMV pUL130 polypeptides, or complex-forming fragments thereof, of the invention comprise one or more stabilizing mutations, suitably, one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. Any of the pUL130 polypeptides having the sequence as set forth in SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID NO: 20, or any pUL130 polypeptide having a sequence at least 90% identical to SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID NO: 20, or complex-forming fragments thereof, may suitably comprise any one or more stabilizing mutations as identified and defined herein. Accordingly, in some embodiments, the HCMV pUL130 polypeptide having the sequence set forth in SEQ ID NO: 17, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL130 polypeptide having the sequence set forth in SEQ ID NO: 18, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL130 polypeptide having the sequence set forth in SEQ ID NO: 19, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL130 polypeptide having the sequence set forth in SEQ ID NO: 20, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof.
As further detailed in Example 3 and illustrated in
The identification of relevant amino acid residues, in the HCMV pUL130 polypeptide, to mutate for cavity-filling and/or repacking and/or disulfide bridges may be performed, for example, by both visual inspection of the three-dimensional structure with the aid of molecular graphics softwares, or by using any appropriate in-silico mutagenesis method. Such methods are known to the skilled person. For example, softwares, such as Molecular Operating Environment (MOE) (edited by Chemical Computing Group Inc.) allows for a systematic analysis of amino acid sequences to identify specific regions in the polypeptide or specific amino acids, the mutation of which is predicted to enhance thermo-stability.
Suitable non-limiting exemplary amino acid residues to mutate in HCMV pUL130 polypeptides are provided in Table 4, using the sequence set forth as SEQ ID NO: 17 as a reference only. Mutations at similar positions in other HCMV pUL130 polypeptides, such as for example, polypeptides having the sequence as set forth in SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, or pUL130 polypeptides originating from HCMV strains different from the Merlin strain, are also contemplated in the present invention. It is within the skilled person's abilities to determine such similar positions in other HCMV pUL130 polypeptides. Comparable amino acid positions in a given HCMV pUL130 polypeptide can be determined by aligning the amino acid sequences using readily available and well-known alignment and algorithms (such as BLAST or ClustalW2). The actual number of the amino acid position may have to be adjusted for other HCMV pUL130 polypeptides depending on the actual sequence alignment.
Accordingly, in some embodiments, amino acids to mutate in HCMV pUL130 polypeptides for cavity filling are D165 or H209 of the sequence set forth in SEQ ID NO: 17, or at a corresponding position in HCMV pUL130 polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L). Accordingly, in some embodiments, the HCMV pUL130 polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 165 and 209 of SEQ ID NO: 17 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL130 polypeptides for hydrophobic mutations are G116, G135, H150, or H209 of the sequence set forth in SEQ ID NO: 17, or at a corresponding position in HCMV pUL130 polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P). Accordingly, in some embodiments, the HCMV pUL130 polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to G116, G135, H150, and H209 of SEQ ID NO: 17 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL130 polypeptides for disulphide bridge mutations are G116, H150, P64, S178, P62, E95, Y204, N211, I213, Y56, or T167 of the sequence set forth in SEQ ID NO: 17, or at a corresponding position in HCMV pUL130 polypeptides originating from different HCMV strains. Accordingly, in some embodiments, the HCMV pUL130 polypeptides of the invention comprise at least a substitution selected from the group consisting of: G116C, H150C, P64C, S178C, P62C, E95C, Y204C, N211C, I213C, Y56C, T167C, and combinations thereof.
Accordingly, in some embodiments, amino acids to mutate in HCMV pUL130 polypeptides for deglycosylation are N85, N118, and N201 of the sequence set forth in SEQ ID NO: 17, or at a corresponding position in HCMV pUL130 polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D). Accordingly, in some embodiments, the HCMV pUL130 polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 85, 118, and 201 of SEQ ID NO: 17 with amino acids selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D).
pUL131A Polypeptide
pUL131A function is required for HCMV replication not only in endothelial cells but also in epithelial cells. The pUL131A from HCMV strains Merlin (NCBI GI:39842126 (which is also NCBI GenBank Accession No. AAR31670.1), SEQ ID NO: 21) and Towne (NCBI GI:239909474 (which is also NCBI GenBank Accession No. ACS32421.1), SEQ ID NO: 23) and AD169 (NCBI GI:219879712 (which is also NCBI GenBank Accession No. DAA06452.1), SEQ ID NO: 24) have been reported to consist of 129, 129 and 76 amino acids, respectively. pUL131A contains an N-terminal signal sequence, which is located at residues 1-18 of SEQ ID NO: 21, and lacks a TM domain. The UL131A from strain AD169 has been reported to contain a one (1) base-pair insertion, which causes a frame-shift (Wang and Shenk, Human Cytomegalovirus UL131 Open Reading Frame Is Required for Epithelial Cell Tropism, 2005 J. Virol. 79: 10330-10338). SEQ ID NO: 21 is 96% identical to SEQ ID NO: 24 over the N-terminal 28 amino acids, but it is only 36% identical to SEQ ID NO: 24 over the full-length of SEQ ID NO: 21 due to the frame-shift in the AD169 UL131A gene.
Typically, the N-terminal signal sequence of pUL131A polypeptides is cleaved by a host cell signal peptidase to produce mature pUL131A proteins. The pUL131A polypeptides for use in the invention may lack an N-terminal signal sequence. An example of HCMV pUL131A polypeptide lacking the N-terminal sequence is SEQ ID NO: 22, which consists of amino acid residues 19-129 of SEQ ID NO: 21.
As shown within the Examples, pUL131A polypeptides are glycosylated (comprise glycans via N-linked glycosylation) at asparagine residue N81 numbered with respect to gH amino acid sequence SEQ ID NO: 21. An HCMV pUL131A polypeptide, or complex-forming fragment thereof, for use in the invention may comprise a deglycosylation mutation at the asparagine located at residue 81 numbered with respect to pUL131A sequence SEQ ID NO: 21. The deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), or aspartate (D). In particular, the deglycosylation mutation may be glutamine (Q), serine (S), threonine (T), or alanine (A). Further in particular, the deglycosylation mutation may be glutamine (Q).
pUL131A proteins of the invention can have various degrees of identity to SEQ ID NO: 21, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 21. pUL131A proteins of the invention can have various degrees of identity to SEQ ID NO: 22, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 22. pUL131A proteins of the invention can have various degrees of identity to SEQ ID NO: 23, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 23. pUL131A proteins of the invention can have various degrees of identity to SEQ ID NO: 24, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 24. Preferred pUL131A proteins: (i) can form pentameric gH/gL/pUL128/pUL130/pUL131A complexes, (ii) comprise at least one epitope of SEQ ID NO: 21, 22, 23, or 24; and/or (iii) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion. An exemplary complex-forming fragment of pUL131A comprises residues Gln19 through Asn129 of SEQ ID NO: 21 which, when in complex with full length pUL130 (i.e., none of gH, gL, or pUL128 are present); may form a truncated HCMV pentamer complex that nonetheless maintains the five conformational epitope sites described in
The HCMV pUL131A polypeptides, or complex-forming fragments thereof, of the invention comprise one or more stabilizing mutations, suitably, one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. Any of the pUL131A polypeptides having the sequence as set forth in SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24, or any pUL131A polypeptide having a sequence at least 90% identical to SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24, or complex-forming fragments thereof, may suitably comprise any one or more stabilizing mutations as identified and defined herein. Accordingly, in some embodiments, the HCMV pUL131A polypeptide having the sequence set forth in SEQ ID NO: 21, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL131A polypeptide having the sequence set forth in SEQ ID NO: 22, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL131A polypeptide having the sequence set forth in SEQ ID NO: 23, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof. In alternative embodiments, the HCMV pUL131A polypeptide having the sequence set forth in SEQ ID NO: 24, or complex-forming fragments thereof, comprises one or more cavity-filling mutations, one or more hydrophobic mutations, one or more hydrophilic mutations, one or more disulfide bridge mutations, or any combination of one or more thereof.
As further detailed in Example 3 and illustrated in
The identification of relevant amino acid residues, in the HCMV pUL131A polypeptide, to mutate for cavity-filling and/or repacking and/or disulfide bridges may be performed, for example, by both visual inspection of the three-dimensional structure with the aid of molecular graphics softwares, or by using any appropriate in-silico mutagenesis method. Such methods are known to the skilled person. For example, softwares, such as Molecular Operating Environment (MOE) (edited by Chemical Computing Group Inc.) allows for a systematic analysis of amino acid sequences to identify specific regions in the polypeptide or specific amino acids, the mutation of which is predicted to enhance thermo-stability.
Suitable non-limiting exemplary amino acid residues to mutate in HCMV pUL131A polypeptides are provided in Table 5, using the sequence set forth as SEQ ID NO: 21 as a reference only. Mutations at similar positions in other HCMV pUL131A polypeptides, such as for example, polypeptides having the sequence as set forth in SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NIO: 24, or pUL131A polypeptides originating from HCMV strains different from the Merlin strain, are also contemplated in the present invention. It is within the skilled person's abilities to determine such similar positions in other HCMV pUL131A polypeptides. Comparable amino acid positions in a given HCMV pUL131A polypeptide can be determined by aligning the amino acid sequences using readily available and well-known alignment and algorithms (such as BLAST or ClustalW2). The actual number of the amino acid position may have to be adjusted for other HCMV pUL130 polypeptides depending on the actual sequence alignment.
Accordingly, in some embodiments, amino acids to mutate in HCMV pUL131A polypeptides for cavity filling are G99, S86 or S90 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV pUL131A polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L). Accordingly, in some embodiments, the HCMV pUL131A polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 99, 86 or 90 of SEQ ID NO: 21 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), tyrosine (Y), and leucine (L).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL131A polypeptides for hydrophobic mutations are H69, H35, H64, D38, V85, Y52, or A67 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV pUL131A polypeptides originating from different HCMV strains. Suitably, the mutation of these amino acid residues may consist of substituting any of them and/or more than one of them, possibly all of them, with amino acid residues, selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P). Accordingly, in some embodiments, the HCMV pUL131A polypeptides of the invention comprise at least an amino acid substitution at positions corresponding to 69, 35, 64, 38, 85, 52, and 67 of SEQ ID NO: 21 with amino acids selected from the group consisting of tryptophan (W), phenylalanine (F), methionine (M), cysteine (C), alanine (A), leucine (L), isoleucine (I), valine (V) and proline (P).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL131A polypeptides for hydrophilic mutations is R118 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of this amino acid residue may consist of substituting it for an amino acid selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), glutamine (Q), arginine (R), and glutamic acid (E). Accordingly, in some embodiments, the HCMV pUL131A polypeptides of the invention comprise at least an amino acid substitution at the position corresponding to R118 of SEQ ID NO: 21 with an amino acid selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N) glutamine (Q), arginine (R), and glutamic acid (E). Suitable non-limiting exemplary amino acids to mutate in HCMV pUL131A polypeptides for polar mutations is R118 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of this amino acid residue may consist of substituting it for an amino acid selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), and glutamine (Q). Accordingly, in some embodiments, the HCMV pUL131A polypeptides of the invention comprise at least an amino acid substitution at position R118 with an amino acid selected from the group consisting of serine (S), threonine (T), cysteine (C), tyrosine (Y), asparagine (N), and glutamine (Q). Suitable non-limiting exemplary amino acids to mutate in HCMV pUL131A polypeptides for charged mutations is R118 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of this amino acid residue may consist of substituting it for arginine (R) or glutamic acid (E). Accordingly, in some embodiments, the HCMV gH polypeptides of the invention comprise at least an amino acid substitution at the position corresponding to R118 of SEQ ID NO: 21 with the amino acid arginine (R) or glutamic acid (E).
Suitable non-limiting exemplary amino acids to mutate in HCMV pUL131A polypeptides for disulphide bridge mutations are H64 or W37 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Accordingly, in some embodiments, the HCMV polypeptides of the invention comprise at least a substitution of the residue corresponding to H64 of SEQ ID NO: 21 with a cysteine (C) or of the residue corresponding to W37 of SEQ ID NO: 21 with a cysteine (C), suitably, both.
Accordingly, in some embodiments, an amino acid to mutate in HCMV pUL131A polypeptides for deglycosylation is N81 of the sequence set forth in SEQ ID NO: 21, or at a corresponding position in HCMV gH polypeptides originating from different HCMV strains. Suitably, the mutation of this amino acid residue may consist of substituting it with an amino acid residue selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D). Accordingly, in some embodiments, the HCMV pUL131A polypeptides of the invention comprise at least an amino acid substitution at position 81 of SEQ ID NO: 21 with an amino acid selected from the group consisting of glutamine (Q), serine (S), threonine (T), alanine (A), glutamate (E), and aspartate (D).
HCMV glycoprotein O (gO), which is encoded by the UL74 gene, has been reported to act as a molecular chaperone, increasing gH/gL ER export and incorporation into virions. It has been proposed that gO competes with pUL128-131A for binding onto gH/gL but is released from gH/gL, so that gH/gL (lacking pUL128-131A) is incorporated into virions (Ryckman et al., Human Cytomegalovirus TR Strain Glycoprotein O Acts as a Chaperone Promoting gH/gL Incorporation into Virions but Is Not Present in Virions, 2010 Journal of Virology 84: 2597-2609). Compared with other viral genes, HCMV gO is unusually variable among different HCMV strains: the variability of the gO amino acid sequence among 22 to 39 clinical isolates, as well as laboratory strains AD169, Towne and Toledo approached 45% (i.e. there was only 55% identity between the gO amino acid sequences between different isolates) (Rasmussen, et al., The Genes Encoding the gCIII Complex of Human Cytomegalovirus Exist in Highly Diverse Combinations in Clinical Isolates, 2002 J. Virol. 76: 10841-10888). The gO from HCMV strains Merlin (NCBI GI:39842082 (which is also NCBI GenBank Accession No. AAR31626.1), SEQ ID NO: 25 herein), AD169 (NCBI GI:136968 (which is also NCBI UniProtKB Accession No. P16750.1), SEQ ID NO: 28 herein) and Towne (NCBI GI:239909431 (which is also NCBI GenBank Accession No. ACS32378.1), SEQ ID NO: 27 herein) have been reported to consist of 472, 466 and 457 amino acids, respectively. The gO of HCMV strain AD169, which shares a 73% amino acid similarity to SEQ ID NO: 25, has 18 N-glycosylation sites (at residues 75, 83, 87, 103, 130, 157, 162, 171, 219, 242, 288, 292, 350, 385, 392, 399, 433 and 454), and may include a cleavable signal sequence at its N-terminus (predicted to consist of amino acid residues 1-30), which is absent from the mature polypeptide. Rigoutsos et al. (In silico pattern-based analysis of the human cytomegalovirus genome, 2003 J. Virol. 77: 4326-4344) predicted the presence of TM domains (in regions 10-28 and 190-212) and a coiled coil region (residues 240-272).
Typically, the N-terminal signal sequence of gO proteins is cleaved by a host cell signal peptidase to produce mature gO proteins. The gO proteins in HCMV membrane complexes of the invention may lack an N-terminal signal sequences. An example of a preferred gO protein of the invention is SEQ ID NO: 26, which lacks an N-terminal signal sequence and consists of amino acid residues 31-472 of SEQ ID NO: 25.
gO proteins of the invention can have various degrees of identity to SEQ ID NO: 25, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 25. gO proteins of the invention can have various degrees of identity to SEQ ID NO: 26, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 26. gO proteins of the invention can have various degrees of identity to SEQ ID NO: 27, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 27. gO proteins of the invention can have various degrees of identity to SEQ ID NO: 28, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 28. gO nucleotide sequences of the invention can have various degrees of identity to SEQ ID NO: 39, such as an at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence recited in SEQ ID NO: 39. Preferred gO proteins or fragments thereof: (i) can form part of the trimeric gH/gL/gO complex; (ii) cannot form part of the pentameric gH/gL/pUL128/pUL130/pUL131A complex, (iii) comprise at least one epitope of SEQ ID NO: 25, 26, 27, or 28; and/or (iv) can elicit antibodies in vivo which immunologically cross-react with an HCMV virion.
In another aspect, the invention provides a pentameric complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more stabilizing mutations, (ii) any of the above HCMV gL polypeptide comprising one or more stabilizing mutations, (iii) any of the above HCMV pUL128 polypeptide comprising one or more stabilizing mutations, (iv) any of the above HCMV pUL130 polypeptide comprising one or more stabilizing mutations, and (v) any of the above HCMV pUL131A polypeptide comprising one or more stabilizing mutations. Such complexes may further include, e.g., (vi) any of the above HCMV gH, gL, pUL130, and pUL131A deglycosylation mutations.
In another aspect, the invention provides a gH/gL complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more stabilizing mutations, and/or (ii) any of the above HCMV gL polypeptide comprising one or more stabilizing mutations. Such complexes may further include, e.g., (iii) any of the above HCMV gH and gL deglycosylation mutations.
In another aspect, the invention provides a gH/gL/gO complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more stabilizing mutations and/or (ii) any of the above HCMV gL polypeptide comprising one or more stabilizing mutations. Such complexes may further include, e.g., (iii) any of the above HCMV gH and gL deglycosylation mutations.
In another aspect, the invention provides a pentameric complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more deglycosylation mutations, (ii) any of the above HCMV gL polypeptide comprising a deglycosylation mutation, (iii) any of the above HCMV pUL130 polypeptide comprising one or more deglycosylation mutations, and (iv) any of the above HCMV pUL131A polypeptide comprising a deglycosylation mutation. Such complexes may further include, e.g., (vi) any of the above HCMV gH, gL, pUL130, and pUL131A stabilization mutations.
In another aspect, the invention provides a gH/gL complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more deglycosylation mutations, and/or (ii) any of the above HCMV gL polypeptide comprising a deglycosylation mutation. Such complexes may further include, e.g., (iii) any of the above HCMV gH and gL stabilization mutations.
In another aspect, the invention provides a gH/gL/gO complex comprising the mutated HCMV polypeptides, or complex-forming fragments thereof, described herein. Such complexes include, e.g. (i) any of the above HCMV gH polypeptide comprising one or more deglycosylation mutations, and/or (ii) any of the above HCMV gL polypeptide comprising a deglycosylation mutation. Such complexes may further include, e.g., (iii) any of the above HCMV gH and gL stabilization mutations.
In another aspect, the invention provides a composition comprising a mutant polypeptide, or a mutant complex-forming fragment thereof, wherein the mutant polypeptide or mutant fragment is at least one of HCMV gH, gL, pUL128, pUL130, and pUL131A and wherein the mutant polypeptide or mutant fragment comprises at least one stabilizing mutation. In another aspect, the invention provides a composition comprising a mutant polypeptide, or a mutant complex-forming fragment thereof, wherein the mutant polypeptide or mutant fragment is at least one of HCMV gH, gL, pUL130, and pUL131A and wherein the mutant polypeptide or mutant fragment comprises at least one deglycosylation mutation. In another aspect, the invention provides a composition comprising a complex having at least one mutant polypeptide, or a mutant complex-forming fragment thereof, wherein the mutant polypeptide or mutant fragment is at least one of HCMV gH, gL, pUL128, pUL130, and pUL131A and wherein the mutant polypeptide or mutant fragment comprises at least one stabilizing mutation. In another aspect, the invention provides a composition comprising a complex having at least one mutant polypeptide, or a mutant complex-forming fragment thereof, wherein the mutant polypeptide or mutant fragment is at least one of HCMV gH, gL, pUL130, and pUL131A and wherein the mutant polypeptide or mutant fragment comprises at least one deglycosylation mutation. The complexes of the present invention include gH/gL, gH/gL/gO, and pentameric complexes. The composition of the present invention may be an immunogenic composition. The composition of the present invention may be a vaccine composition. Such compositions can be used to raise antibodies in a mammal (e.g. a human, murine, guinea pig, or macaque).
The invention provides pharmaceutical compositions comprising a stabilized HCMV complex (e.g., pentamer complex) as described herein. Similarly, the invention provides processes for making a pharmaceutical composition involving combining a stabilized HCMV complex (e.g., pentamer complex) of the invention with a pharmaceutically acceptable carrier.
In addition to their antigens, immunogenic and pharmaceutical compositions of the invention typically include a “non-antigen component” which as used with respect to the present invention may be an adjuvant or a pharmaceutically acceptable carrier. A thorough discussion of such carriers is available in Remington: The Science and Practice of Pharmacy. Pharmaceutically acceptable carriers include any carrier that does not itself induce the production of neutralizing antibodies. Suitable carriers are typically large, slowly metabolised macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, sucrose, trehalose, lactose, and lipid aggregates (such as oil droplets or liposomes). Sterile pyrogen-free, phosphate-buffered physiologic saline is a typical carrier. Such carriers are well known to those of ordinary skill in the art. The non-antigen component may be a diluent, such as water, saline, glycerol, etc. Additionally, a non-antigen component may be auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and the like. Non-antigen components of the invention include an antimicrobial, particularly when packaged in multiple dose format. Non-antigen components of the invention include detergents, e.g., a TWEENT™ (polysorbate), such as TWEEN80™. Detergents are generally present at low levels e.g. <0.01%. Non-antigen components of the invention include sodium salts (e.g., sodium chloride) to give tonicity. A concentration of 1.0±2 mg/ml NaCl is typical. Non-antigen components of the invention include a buffer, such as a phosphate buffer. Non-antigen components of the invention include a sugar alcohol (e.g. mannitol) or a disaccharide (e.g., sucrose or trehalose), e.g., at around 15-30 mg/ml (e.g. 25 mg/ml).
The pH of the composition is usually between 6 and 8, and more preferably between 6.5 and 7.5 (e.g. about 7). Stable pH may be maintained by the use of a buffer e.g. a Tris buffer, a citrate buffer, phosphate buffer, or a histidine buffer. Thus, a composition will generally include a buffer. A composition may be sterile and/or pyrogen-free. Compositions may be isotonic with respect to humans. A composition comprises an immunologically effective amount of the referenced antigen(s). An ‘immunologically effective amount’ is an amount which, when administered to a subject, is effective for eliciting an antibody response against the antigen. This amount can vary depending upon the health and physical condition of the individual to be treated, their age, the capacity of the individual's immune system to synthesize antibodies, the degree of protection desired, the formulation of the vaccine, the treating doctor's assessment of the medical situation, and other relevant factors. It is expected that the amount will fall in a relatively broad range that can be determined through routine trials. The antigen content of compositions of the invention will generally be expressed in terms of the mass of protein per dose. A dose of 10-500 μg (e.g. 50 μg) per antigen can be useful. Immunogenic compositions may include an immunological adjuvant.
Compositions may include an antimicrobial, particularly when packaged in multiple dose format. Antimicrobials such as thiomersal and 2-phenoxyethanol are commonly found in vaccines, but it is preferred to use either a mercury-free preservative or no preservative at all. Compositions may comprise detergent e.g. a polysorbate, such as polysorbate 80. Detergents are generally present at low levels e.g. <0.01%. Compositions may include sodium salts (e.g. sodium chloride) to give tonicity. A concentration of 10+2 mg/ml NaCl is typical e.g. about 9 mg/ml.
Compositions of the invention will generally be administered directly to a subject. Direct delivery may be accomplished by parenteral injection (e.g. subcutaneously, intraperitoneally, intravenously, intramuscularly, or to the interstitial space of a tissue), or by any other suitable route. Intramuscular administration is preferred e.g. to the thigh or the upper arm. Injection may be via a needle (e.g. a hypodermic needle), but needle-free injection may alternatively be used.
Vaccines of the invention may be prophylactic (i.e. to prevent disease) or therapeutic (i.e. to reduce or eliminate the symptoms of a disease).
In one aspect, the invention provides a process for expressing the mutant HCMV polypeptide(s) of the invention. Suitable expression systems for use in the present invention are described in detail in, for example, Doyle (High Throughput Protein Expression and Purification: Methods and Protocols in M
Examples of suitable expression systems include, for example, chromosomal, episomal and virus-derived systems, including, for example, vectors derived from: bacterial plasmids, bacteriophage, transposons, yeast episomes, insertion elements, yeast chromosomal elements, viruses such as baculoviruses, papovaviruses such as SV40, vaccinia viruses, adenoviruses, fowl poxviruses, pseudorabies viruses and retroviruses, or combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, including cosmids and phagemids. Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of DNA than can be contained and expressed in a plasmid. Suitable expression systems include microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected or transfected with virus expression vectors (for example, baculovirus); plant cell systems transformed with virus expression vectors (for example, cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (for example, Ti or pBR322 plasmids); or animal cell systems. Cell-free translation systems can also be employed to produce the proteins of the invention. Preferably, the proteins of the invention are produced in eukaryotic cells, such as mammalian cells.
Recombinant polypeptides may be expressed transiently or stably. Preferably, the recombinant proteins are expressed stably. For example, cell lines that stably express the peptide of interest may be transfected using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells that successfully express the introduced sequences. Resistant clones of stably transformed cells may be proliferated using tissue culture techniques appropriate to the cell type.
Mammalian cell lines available as hosts (or host cells) for expression are known in the art and include many immortalised cell lines available from the American Type Culture Collection (ATCC) including, but not limited to, Chinese hamster ovary (CHO), HeLa, baby hamster kidney (BHK), monkey kidney (COS), C127, 3T3, BHK, human embryonic kidney (HEK) 293, Bowes melanoma and human hepatocellular carcinoma (for example Hep G2) cells and a number of other cell lines. Expression in mammalian cells is preferable because the proteins that are produced will have authentic mammalian glycosylation patterns, and thus possess epitopes that are present on infectious HCMV particles. Accordingly, production of membrane protein complexes of the invention in mammalian cells will lead to the production of antibodies that are able to bind to naturally occurring HCMV particles during infection.
In the baculovirus system, the materials for baculovirus/insect cell expression systems are commercially available in kit form from, inter alia, Invitrogen, San Diego Calif. (the “MaxBac” kit). These techniques are described fully in Summers et al. (Summers and Smith, A manual of methods for baculovirus vectors and insect cell culture procedures, Texas Agricultural Experiment Station Bulletin No. 1555, 1987). Particularly suitable host cells for use in this system include insect cells such as Drosophila S2 (i.e. by recombinant baculovirus infection of stably transfected Drosophila S2 cells) and Spodoptera Sf9 cells. In some embodiments, the proteins of the invention are not produced in insect cells. There are many plant cell culture and whole plant genetic expression systems known in the art. Examples of suitable plant cellular genetic expression systems include those described in U.S. Pat. Nos. 5,693,506; 5,659,122; 5,608,143. In particular, all plants from which protoplasts can be isolated and cultured to give whole regenerated plants can be utilised, so that whole plants are recovered which contain the transferred gene. Practically all plants can be regenerated from cultured cells or tissues, including but not limited to all major species of sugar cane, sugar beet, cotton, fruit and other trees, legumes and vegetables.
Examples of prokaryotic expression systems include those that use streptococci, staphylococci, E. coli, Streptomyces and Bacillus subtilis as host cells.
Examples of fungal expression systems include those that use yeast (for example, S. cerevisiae) and Aspergillus as host cells.
HEK293 cells are suitable for transient expression of the HCMV proteins of the invention due to their high transfectability by various techniques, including the calcium phosphate and polyethylenimine (PEI) methods. A useful cell line of HEK293 is one that expresses the EBNA1 protein of EBV, such as 293-6E (Loignon et al., Stable high volumetric production of glycosylated human recombinant IFNalpha2b in HEK293 cells, 2008 BMC Biotechnology 8:65). Transformed HEK293 cells have been shown to secrete high levels of the protein complexes of the invention into the growth medium, thus allowing the purification of such protein complexes directly from the growth medium.
CHO cells are particularly suitable mammalian hosts for industrial production of the HCMV proteins of the invention for use as immunogens or antigens because they allow long-term, stable gene expression and high yields of proteins.
In some embodiments, the mutant HCMV polypeptide(s) or mutant complex of the invention is secreted from the cells in which they are expressed. In other embodiments of the invention, the mutant polypeptide or mutant complex of the invention is not secreted. In E. coli, for example, non-secreted proteins may accumulate in inclusion bodies. Methods for purifying recombinant proteins from inclusion bodies are well known in the art.
Transfection can be carried out by a range of methods including using calcium phosphate, electroporation, or by mixing a cationic lipid with the material to produce liposomes which fuse with the cell membrane and deposit their cargo inside.
The invention provides recombinant nucleic acid molecules having a nucleotide sequence which encode the mutated HCMV gH polypeptides, the mutated HCMV gL polypeptides, the mutated HCMV pUL128 polypeptides, the mutated HCMV pUL130 polypeptides and/or the mutated HCMV pUL131A polypeptides described herein. The recombinant nucleic acid molecules of the present invention may be within a vector (an expression vector, for example) and may be operably linked to one or more control element (a promoter and/or an enhancer, for example). An example of said recombinant nucleic acid may be a single molecule which encodes a gL polypeptide of the invention, a gH polypeptide of the invention, a pUL128 polypeptide of the invention, a pUL130 polypeptide of the invention and a pUL131A polypeptide of the invention. A further example of said recombinant nucleic acid may be a single molecule which encodes a gL polypeptide of the invention and a gH polypeptide of the invention, optionally further encoding a gO polypeptide of the invention. The invention also provides a plurality of recombinant nucleic acid molecules which encode one or more mutated polypeptides of the invention. For example, in one embodiment the invention provides two nucleic acid molecules: the first molecule encoding a gH polypeptide of the invention and a gL polypeptide of the invention, and the second molecule encoding a pUL128 protein of the invention, a pUL130 protein of the invention and a pUL131A polypeptide of the invention. For example, in one embodiment the invention provides three nucleic acid molecules: a first recombinant nucleic acid molecule which encodes a gL protein of the invention; a second recombinant nucleic acid molecule which encodes a gH protein of the invention; and a third recombinant nucleic acid molecule which encodes one or more additional HCMV proteins such as gO, pUL128, pUL130, or pUL131A. For example, in one embodiment the invention provides five nucleic acid molecules: a first recombinant nucleic acid molecule which encodes a gL protein of the invention; a second recombinant nucleic acid molecule which encodes a gH protein of the invention; a third recombinant nucleic acid molecule which encodes a pUL128 protein of the invention; a fourth recombinant nucleic acid molecule which encodes a pUL130 protein of the invention; and a fifth recombinant nucleic acid molecule which encodes a pUL131A protein of the invention. Preferably, the recombinant nucleic acid molecules of the present invention (a) is/are not a self-replicating RNA molecule; (b) is/are not (an) alphavirus replicon(s); (c) do(es) not encode any alphavirus nonstructural proteins, such as NSP1, NSP2, NSP3 and NSP4; (d) do(es) not contain an Internal Ribosomal Entry Site (IRES), such as EMCV or EV71; and/or (e) do(es) not contain a viral 2A site, such as FMDV. Thus, the sequences encoding each individual polypeptide in a complex can be present in a single nucleic acid molecule, or distributed among two or more nucleic acid molecules.
In one embodiment, the invention provides a plurality of recombinant nucleic acids comprising: (i) a first recombinant nucleic acid molecule which encodes a gL protein of the invention, (ii) a second recombinant nucleic acid molecule which encodes a gH protein of the invention, (iii) a third recombinant nucleic acid molecule which encodes a pUL128 protein of the invention, (iv) a fourth recombinant nucleic acid molecule which encodes a pUL130 protein of the invention, and (v) a fifth recombinant nucleic acid molecule which encodes a pUL131A protein of the invention. See, e.g., WO 2014/005959 (also published as U.S. Pre-grant Pub. No. 2016-0159864. Preferably, said first, second, third, fourth and/or fifth recombinant nucleic acid molecule(s): (a) is/are not a self-replicating RNA molecule; (b) is/are not (an) alphavirus replicon(s); (b) do(es) not encode any alphavirus nonstructural proteins, such as NSP1, NSP2, NSP3 and NSP4; (c) do(es) not contain an Internal Ribosomal Entry Site (IRES), such as EMCV or EV71; and/or (d) do(es) not contain a viral 2A site, such as FMDV.
Nucleic acid molecules which encode a gH protein of the invention can have various degrees of identity to SEQ ID NO: 1 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 1. Nucleic acid molecules which encode a gH protein of the invention can have various degrees of identity to SEQ ID NO: 3 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 3. Nucleic acid molecules which encode a gL protein of the invention can have various degrees of identity to SEQ ID NO: 7 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 7. Nucleic acid molecules which encode a gL protein of the invention can have various degrees of identity to SEQ ID NO: 9 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 9. Nucleic acid molecules which encode a gL protein of the invention can have various degrees of identity to SEQ ID NO: 29 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 29. Nucleic acid molecules which encode a pUL128 protein of the invention can have various degrees of identity to SEQ ID NO: 13 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 13. Nucleic acid molecules which encode a pUL130 protein of the invention can have various degrees of identity to SEQ ID NO: 17 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 17. Nucleic acid molecules which encode a pUL131A protein of the invention can have various degrees of identity to SEQ ID NO: 21 such as at least 89% 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to the sequence of SEQ ID NO: 21.
The recombinant nucleic acid molecules of the invention may comprise DNA, optionally including introns, and/or cDNA. Some genes are expressed more efficiently when introns are present. Genomic UL128 and UL131A genes each consist of two exons, whereas UL130 does not contain any introns. The recombinant nucleic acid molecule of the invention may comprise ribonucleic acid (RNA), including mRNA, with the proviso that the RNA molecule of the present invention (a) is/are not a self-replicating RNA molecule; (b) is/are not (an) alphavirus replicon(s); (c) do(es) not encode any alphavirus nonstructural proteins, such as NSP1, NSP2, NSP3 and NSP4; (d) do(es) not contain an Internal Ribosomal Entry Site (IRES), such as EMCV or EV71; and/or (e) do(es) not contain a viral 2A site, such as FMDV. The nucleic acid molecules of the invention may comprise polynucleotide sequences (DNA or RNA) that have been codon optimized for expression within a host cell, for example, codon optimized for expression within a bacterial or mammalian host cell.
The invention provides vectors that comprise the nucleic acid molecules of this invention. A vector of this invention may be an expression vector comprising promoters and terminators suitable for expression within a host cell. Such promoters and terminators have been described by, for example, U.S. Pre-grant Pub. Nos. 2015/0322115 and 2015/0359879. Said recombinant nucleic acid molecules may be plasmids, or may be incorporated into the genome of a cell. The promoters in these vectors can be HCMV promoters or non-HCMV promoters (see, e.g., U.S. Pre-grant Pub. Nos. 2015/0322115 and 2015/0359879).
Exemplary procedures sufficient to guide one of ordinary skill in the art through the production of recombinant HCMV nucleic acids of the invention can be found in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 1989; Sambrook et al., Molecular Cloning: A Laboratory Manual, 3d ed., Cold Spring Harbor Press, 2001; Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing Associates, 1992 (and Supplements to 2003); and Ausubel et al., Short Protocols in Molecular Biology: A Compendium of Methods from Current Protocols in Molecular Biology, 4th ed., Wiley & Sons, 1999.
The invention also provides a process for expressing a HCMV complex (e.g., a HCMV pentameric complex) comprising one or more mutated HCMV gH, gL, pUL128, pUL13 and/or pUL131A polypeptides of the invention by introducing one or more recombinant nucleic acid molecules which encode said one or more mutated polypeptides into an expression system; expressing said one or more nucleic acid molecules in said expression system; and purifying said HCMV complex. In some embodiments, this process comprises transfecting cells with a first nucleic acid construct which encodes: mutated HCMV gH, gL, pUL128, pUL130 and pUL131A polypeptides of the invention. In some embodiments, this process may comprise transfecting cells with a first nucleic acid construct which encodes a HCMV gH polypeptide of the invention, a second nucleic acid construct which encodes a HCMV gL polypeptides of the invention; and one or more third nucleic acid construct(s) which encode(s) one or more additional HCMV glycoprotein(s) of the invention. In some embodiments, this process may comprise transfecting cells with a first nucleic acid construct which encodes a gH polypeptide of the invention and a gL polypeptide of the invention; and a second nucleic acid construct which encodes a HCMV pUL128, a HCMV pUL130 and a HCMV pUL131A polypeptide of the invention. Said HCMV complex may be expressed in a mammalian cell. Said isolated HCMV membrane protein complex may optionally be purified.
The invention also provides a cell that expresses a nucleic acid molecule or plurality of nucleic acid molecules of the invention, wherein said cell does not comprise the full HCMV genome. Said cell may be stably transformed with said nucleic acid molecule or plurality of nucleic acid molecules of the invention. Preferably, said cell is a mammalian cell, for example a 293-6E cell (see WO 2014/005959, also published as U.S. Pre-grant Pub. No. 2016/0159864) or a CHO cell (see WO 2016/116904).
The invention also provides a cell that produces a complex of the present invention, wherein the cell does not (i) contain an HCMV genome, and/or (ii) produce HCMV virions, and/or (iii) express any non-envelope HCMV proteins. Ideally the cell lacks one of (i), (ii) or (iii); preferably, it lacks two; more preferably, it lacks all three of (i), (ii) and (iii). It may therefore be specified that a cell of the present invention does not contain the HCMV genome and/or does not produce HCMV virions and/or does not express any non-envelope HCMV proteins.
While their structure is distinct from non-mutant polypeptides, the modified polypeptides of the present invention (and mutant, construct-forming fragments thereof) maintain immunogenic properties or epitope(s), so it is a further object of the present invention to utilize the mutant polypeptides and mutant fragments thereof in polypeptide/antibody interactions. Polypeptide/antibody interactions of non-mutant polypeptides have been described by, for example, WO 2014/005959 (also published as U.S. Pre-grant Pub. No. 2016/0159864); Macagno et al., Isolation of Human Monoclonal Antibodies that Potently Neutralize Human Cytomegalovirus Infection by Targeting Different Epitopes on the gH/gL/UL128-131A complex, 2010 J. Virol. 84(2): 1005-1013; and Gerna et al., Monoclonal antibodies to different components of the human cytomegalovirus (HCMV) pentamer gH/gL/pUL128L and Trimer gH/gL/gO as well as antibodies elicited during primary HCMV infection prevent epithelial cell syncytium formation, 2016 J. Virol. 90(14): 6216-6223. See also U.S. Pat. No. 9,527,902. Prior to the present invention, a range of conformational epitopes for the pentameric complex were known. For example, Macagno et al. (2010 J. Virol. 84: 1005-1013) isolated a panel of human monoclonal antibodies that neutralized HCMV infection of endothelial, epithelial, and myeloid cells. With the single exception of an antibody that bound to a conserved epitope in the UL128 gene product, all other antibodies bound to conformational epitopes that required expression of two or more proteins of the gH/gL/UL128-131A complex. Preferably, the pentameric complexes of the invention possess one or more of the conformational epitopes identified by Macagno et al. (2010 J. Virol. 84: 1005-1013) and/or further described by the Examples herein.
The invention provides antibodies which recognise a modified HCMV gH, gL, pUL128, pUL130, or pUL131A polypeptide; or complex of the present invention. The antibodies of the invention may have been raised using an isolated polypeptide, or isolated complex comprising it, of the invention as an antigen. Preferably, the antibodies of the invention are neutralizing antibodies. The antibodies of the present invention may be a monoclonal antibody, polyclonal antibody, multispecific antibody (e.g., bispecific antibodies), labelled antibody, or antibody fragment so long as they exhibit the desired antigen-binding activity. An “antibody fragment” or “antigen-binding fragment” refers to a molecule other than an intact antibody that comprises a portion of an intact antibody that binds the antigen to which the intact antibody binds. Examples of antibody fragments include but are not limited to Fv, Fab, Fab′, Fab′-SH, F(ab′)2; diabodies; linear antibodies; single-chain antibody molecules (e.g. scFv); and multispecific antibodies formed from antibody fragments. Papain digestion of antibodies produces two identical antigen-binding fragments, called “Fab” fragments, each with a single antigen-binding site, and a residual “Fc” fragment, whose name reflects its ability to crystallize readily. Pepsin treatment yields an F(ab′)2 fragment that has two antigen-combining sites and is still capable of cross-linking antigen.
Alternatively, an HCMV polypeptide or HCMV complex of the invention may be used to identify antibodies using in vitro selection methods, such as phage display using diverse antibody libraries. The invention also provides a method for raising antibodies using an isolated HCMV polypeptide or HCMV complex of the invention. Antibodies of the invention may be human or humanised antibodies. The antibodies of the invention may be used in a diagnostic assay and may be labelled directly or indirectly. In some embodiments, the antibodies of the invention may be used in therapy, for example in the treatment of HCMV infection and may be in the form of neutralizing antibodies, which can inhibit or neutralize the antigen's biological activity.
Complexes of the invention are preferably prepared and used in isolated form. The term “isolated” as used herein means removed from its natural environment. Hence, an “isolated HCMV membrane protein complex” does not encompass the HCMV membrane protein complex on the surface of HCMV infected cells or within an infectious HCMV virion or bound to an antibody (or antibody fragment). Using the expression methods described in the examples and, for example, WO 2014/005959 (also published as U.S. Pre-grant Pub. No. 2016/0159864, and WO 2016/116904), the complexes of the invention can be produced at high yields. For example, in processes involving growing cells of the invention in growth medium, the protein complex of the invention may accumulate to a level of more than 0.4 mg per litre of growth medium (e.g. 0.45, 0.5, 0.55, 0.6, 0.65, 0.7, 0.75, 0.8, 0.85, 0.9, 0.95, 1.0, 1.2, 1.4, 1.6, 1.8, 2, 2.5, 3, 3.5, 4, 4.5 or 5 mg per litre of growth medium or more).
The invention provides processes for purifying HCMV membrane complexes of the invention. Such processes of the invention allow for production of the HCMV membrane protein complex at a purity of >85%, >86%, >87%, >88%, >89%, >90%, >91%, >92%, >93%, >94% or >95% of total protein by mass, as determined by gel electrophoresis. These high levels of purity make the complexes suitable for use as an immunogen in diagnostic applications or as an antigen in vaccine formulations. The HCMV membrane protein complex of the invention can be prepared at various levels of purity e.g. at least 80%, 85%, 90%, 95%, or 99% of total protein by mass, i.e. the complex makes up at least 80% of the total proteinaceous mass in a composition. The composition may be free from polyacrylamide.
The invention provides a process for purifying an HCMV complex (e.g., an HCMV pentamer complex) of the invention. In an embodiment of the invention, said purification comprises one or more chromatographic steps. Said one or more chromatographic steps comprises affinity chromatography, such as Ni2+ affinity chromatography and/or size exclusion chromatography. In an embodiment of the invention, said one or more chromatographic steps comprises ion exchange chromatography. See, e.g., WO 2014/005959 (also published as U.S. Pre-grant Pub. No. 2016/0159864); WO 2016/116904; and WO2015/181142. A polypeptide of the present invention may therefore comprise a tag (e.g., an affinity tag such as a strep tag, myc tag, polyhistidine tag, or combinations thereof) for, for example, isolation of the polypeptide (see Kimple et al., Overview of Affinity Tags for Protein Purification, 2015 Curr. Protoc. Protein. Sci. 73: Unit-9.9. doi:10.1002/0471140864.ps0909s73, summarizing known tags and their use for biotechnology applications).
Vaccine and immunogenic compositions of the invention may comprise an adjuvant in addition to the antigen. Adjuvants are “non-antigen components” used in vaccines in order to enhance and modulate the immune response to the antigen. However, adjuvants can result in increased reactogenicity. adjuvants include (but are not limited to) AS01, oil-in-water emulsions (for example MF59, and AS03), liposomes, saponins, TLR2 agonists, TLR3 agonists, TLR4 agonists, TLR5 agonists, TLR6 agonists, TLR7 agonists, TLR8 agonists, TLR9 agonists, aluminium salts, nanoparticles, microparticles, ISCOMS, calcium fluoride and organic compound composites or combinations thereof. See, e.g., U.S. Pre-grant Pub. No. 2015/0093431 and WO2011/027222 (also published as U.S. Pre-grant Pub. No. 2012/0237546). In a particular embodiment, the vaccine or immunogenic composition of the invention comprises an antigen and an adjuvant wherein the adjuvant is AS01, an oil-in-water emulsion (e.g., MF59, and AS03 and their respective subtypes including subtypes B and E), an aluminum salt (e.g., aluminum phosphate and aluminum hydroxide), a saponin (e.g. QS21), an agonist of Toll-like receptors (TLRa) (e.g., TLR4a and TLR7a), or a combination thereof (e.g., Alum-TLR7a (Buonsanti et al., Novel adjuvant Alum-Tlr7a significantly potentiates immune response to glycoconjugate vaccines, 2016 Sci. Rep. 6:29063 (DOI: 10.1038/srep29063)). By “TLR agonist” it is meant a component which is capable of causing a signaling response through a TLR signaling pathway, either as a direct ligand or indirectly through generation of endogenous or exogenous ligand (Sabroe et al, Toll-like Receptors in Health and Disease: Complex Questions Remain, 2003 J. Immunol. 171(4): 1630-1635). A TLR4 agonist, for example, is capable of causing a signalling response through a TLR-4 signalling pathway. A suitable example of a TLR-4 agonist is a lipopolysaccharide, suitably a non-toxic derivative of lipid A, particularly monophosphoryl lipid A or more particularly 3-Deacylated monophoshoryl lipid A (3D-MPL). The adjuvants described herein may be combined with any of the antigen(s) herein described.
An HCMV gH polypeptide, or complex-forming fragment thereof, comprising a mutation at a location determined with respect to the sequence SEQ ID NO: 1 and that is:
A102W, A102F, A102Y, A102L, A372W, A372F, A372Y, A372L, A352W, A352F, A352Y, A352L, L257W, L257F, L257Y, L257L,
H252W, H252F, H252M, H252C, H252A, H252L, H252I, H252V, H252P, H252Y, K404W, K404F, K404M, K404C, K404A, K404L, K404I, K404V, K404P, K404Y, R255W, R255F, R255M, R255C, R255A, R255L, R255I, R255V, R255P, R255Y, E355W, E355F, E355M, E355C, E355A, E355L, E3551, E355V, E355P, E355Y, H480W, H480F, H480M, H480C, H480A, H480L, H480I, H480V, H480P, H480Y, S601W, S601F, S601M, S601C, S601A, S601L, S601I, S601V, S601P, S601Y, R405W, R405F, R405M, R405C, R405A, R405L, R405I, R405V, R405P, R405Y,
G358S, G358T, G358C, G358Y, G358N, G358Q, G358R, G358E, G358K, G358H, G358D, H275S, H275T, H275C, H275Y, H275N, H275Q, H275R, H275E, H275K, H275H, H275D,
V109C, L111C,
N55Q, N55S, N55T, N55A, N55E, N55D, N62Q, N62S, N62T, N62A, N62E, N62D, N67Q, N67S, N67T, N67A, N67E, N67D, N192Q, N192S, N192T, N192A, N192E, N192D, N641Q, N641S, N641T, N641A, N641E, N641D, N700Q, N700S, N700T, N700A, N700E, N700D or a combination thereof.
An HCMV gL polypeptide, or complex-forming fragment thereof, comprising a mutation at a location determined with respect to the sequence SEQ ID NO: 7 and that is:
H177W, H177F, H177Y, H177L, G224W, G224F, G224Y, G224L, G140W, G140F, G140Y, G140L, G145W, G145F, G145Y, G145L, D146W, D146F, D146Y, D146L, G218W, G218F, G218Y, G218L, L119W, L119F, L119Y, L119L, C233W, C233F, C233Y, C233L, P272W, P272F, P272Y, P272L,
H267W, H267F, H267M, H267C, H267A, H267L, H267I, H267V, H267P, H267Y, H236W, H236F, H236M, H236C, H236A, H236L, H2361, H236V, H236P, H236Y, H245W, H245F, H245M, H245C, H245A, H245L, H245I, H245V, H245P, H245Y, G161W, G161F, G161M, G161C, G161A, G161L, G1611, G161V, G161P, G161Y, C233W, C233F, C233M, C233C, C233A, C233L, C2331, C233V, C233P, C233Y,
G161C, D163C, G224C, G218C, R166C, G140C, R160C, A150C,
N74Q, N74S, N74T, N74A, N74E, and N74D or a combination thereof.
A HCMV pUL128 polypeptide, or complex-forming fragment thereof, comprising a mutation at a location determined with respect to the sequence SEQ ID NO: 13 and that is:
G123W, G123F, G123Y, G123L, V77W, V77F, V77Y, V77L, L103W, L103F, L103Y, L103L, Q119W, Q119F, Q119Y, Q119L,
G145W, G145F, G145M, G145C, G145A, G145L, G1451, G145V, G145P, G145Y, H90W, H90F, H90M, H90C, H90A, H90L, H90I, H90V, H90P, H90Y, G112W, G112F, G112M, G112C, G112A, G112L, G1121, G112V, G112P, G112Y,
R142C, N99C, Y98C, A124C, G126C, L159C, D45C, V88C, M48C, G107C, R51C, D106C, S83C, or a combination thereof.
A HCMV pUL130 polypeptide, or complex-forming fragment thereof, comprising a mutation at a location determined with respect to the sequence SEQ ID NO: 17 and that is:
D165W, D165F, D165Y, D165L, H209W, H209F, H209Y, H209L,
G116W, G116F, G116M, G116C, G116A, G116L, G1161, G116V, G116P, G116Y, G135W, G135F, G135M, G135C, G135A, G135L, G1351, G135V, G135P, G135Y, H150W, H150F, H150M, H150C, H150A, H150L, H150I, H150V, H150P, H150Y, H209W, H209F, H209M, H209C, H209A, H209L, H2091, H209V, H209P, H209Y,
G116C, H150C, P64C, S178C, P62C, E95C, Y204C, N211C, I213C, Y56C, T167C,
N85Q, N85S, N85T, N85A, N85E, N85D, N118Q, N118S, N118T, N118A, N118E, N118D, N201Q, N201S, N201T, N201A, N201E, and N201D or a combination thereof.
A HCMV pUL131A polypeptide, or complex-forming fragment thereof, comprising a mutation at a location determined with respect to the sequence SEQ ID NO: 21 and that is:
G99W, G99F, G99Y, G99L, S86W, S86F, S86Y, S86L, S90W, S90F, S90Y, S90L,
H69W, H69F, H69M, H69C, H69A, H69L, H69I, H69V, H69P, H69Y, H35W, H35F, H35M, H35C, H35A, H35L, H35I, H35V, H35P, H35Y, H64W, H64F, H64M, H64C, H64A, H64L, H641, H64V, H64P, H64Y, D38W, D38F, D38M, D38C, D38A, D38L, D381, D38V, D38P, D38Y, V85W, V85F, V85M, V85C, V85A, V85L, V85I, V85V, V85P, V85Y, Y52W, Y52F, Y52M, Y52C, Y52A, Y52L, Y52I, Y52V, Y52P, Y52Y, A67W, A67F, A67M, A67C, A67A, A67L, A67I, A67V, A67P, A67Y,
R118S, R118T, R118C, R118Y, R118N, R118Q, R118R, R118E, R118K, R118H, R118D,
H64C, W37C,
N81Q, N81A, N81T, N81A, N81E, and N81D or a combination thereof.
Many modifications and variations of the present invention are possible in light of the above teachings. It is therefore to be understood that, within the scope of the appended claims, a person with skill in the art would recognize that the invention may be practiced otherwise than as specifically described. The illustrative embodiments and examples should not be construed as limiting the invention.
Expression of wild type or selenomethionine-labelled (SeMet)-Pentamer was carried out in 293GnTi− cells through transient or stable transfection of two vectors, with one vector encoding gH and gL (having the sequences SEQ ID NOs: 3 and 7, respectively), and the other encoding pUL128, pUL130 and pUL131A (having the sequences SEQ ID NOs: 13, 17, and 21, respectively) two-vector strategy as outlined in Hofmann et al. (Expression of the Human Cytomegalovirus Pentamer Complex for Vaccine use in a CHO System, 2015 Biotech. & Bioeng. 112(12): 2505-2515).
For anti-Pentamer Fab expression, the heavy chain Fab fragment and the full length light chain were each cloned into the mammalian pRS5a expression vector (Novartis AG). A cleavable double Strep-tag was present on the C-terminus of the heavy chain Fab. Anti-Pentamer Fab was expressed transiently in 293Expi cells (Invitrogen Inc.) using Expifectamine 293 transfection kit (ThermoFisher) according to manufacturer's recommendations by transfecting the two vectors encoding the Fab fragment of the heavy chain and the full length light chain of the antibody in a 1:1 ratio.
For Pentamer purification, expression medium was loaded directly onto a StrepTrap HP column (GE Lifesciences) and the protein was eluted according to manufacturer's recommendations using 0.1M Tris pH 8.0, 150 mM NaCl and 2.5 mM desthiobiotin in the elution buffer (IBA Lifesciences). The eluate was incubated with TEV protease (ThermoFisher) overnight at 4 C. The sample was diluted 3-fold with 20 mM Hepes pH 7 to lower total salt concentration to 50 mM prior to loading onto MonoS10/30 column (GE Lifesciences) for ion exchange chromatography. The protein was eluted off the column with a linear gradient of 0 to 1 M NaCl over 10 column volumes. The protein was concentrated and loaded onto a Superose 6 10/300 column and the eluted peak was concentrated to greater than 1 mg/mL.
For Fab purification, expression medium was concentrated 10-fold and buffer exchanged into 25 mM Tris pH 8.0, 150 mM NaCl and 1 mM EDTA using a tangential flow filtration system (Millipore) with a 10 kDa cutoff. The sample was subsequently loaded onto a StrepTrap HP column and eluted similar to the Pentamer purification described above. The Strep tag was cleaved off using PreScission protease (GE Lifesciences) according to manufacturer's recommendations. The cleaved Fab was then purified by size exclusion chromatography over an S200 column (GE Lifesciences) equilibrated with buffer containing 25 mM Tris pH 8.0 and 150 mM NaCl.
For Pentamer-Fab complex purification for crystallization, the Pentamer eluted from the MonoS column was incubated with a 2-fold molar excess of purified Fab and incubated for at least 15 minutes at room temperature. The sample was subsequently loaded over a Superose 6 10/300 column to separate the Pentamer-Fab complex from excess Fab. Peak fractions were pooled and concentrated to >5 mg/mL.
For crystallization experiments purified wild type or selenomethione-labelled (SeMet)-Pentamer was deglycosylated using Endo Hf (New England Biolabs), according to the manufacturer's guidelines, prior to complex formation with Fabs. Initial crystal hits were obtained for a complex between Pentamer and the fragment antigen binding (Fab) of monoclonal antibody (mAb) 8I21 and 9I6 (Macagno et al. (2010 J. Virol. 84:1005-1013), and these appeared as small microcrystals in a drop containing 0.1 μl protein and 0.1 μl of 20% ethanol, 0.1 M Tris pH 8.5 at 20° C. Ethanol was replaced with the less volatile isopropanol in subsequent experiments, and benzamidine was used as additive (from the Hampton Research additive screening) in order to optimize this crystallization condition. The best crystals for WT or SeMet-Pentamer-8I21 Fab crystals were obtained using a reservoir containing 10% (wt/vol) PEG400, 10% isopropanol, 2% (wt/vol) benzamidine and 0.1 M Tris pH 8.2. Crystal hits for Pentamer in complex with the 9I6 Fab were initially obtained in 0.1 M MES pH 6.5 and 15% (wt/vol) PEG methylether 500. These crystals were optimized for growth with various additives. The best diffracting crystal was obtained using a reservoir containing 10% (wt/vol) PEG methyl ether 500, 0.1 M MES pH 6.2 and 10 μM phenol.
3.1 Results—Crystal Structure
The Pentamer structure adopts a helicoid-shape 180 Å in length and 30-80 Å in cross-over (
Although structural comparisons reveal a close similarity of HCMV gH with gH of the γ-herpesvirus Epstein Barr Virus (EBV), the HCMV gH adopts a boot shape, reminiscent of the α-herpesviruses Varicella Zoster Virus (VZV) and Herpes Simplex Virus-2 (HSV-2) gH/gL, rather than the rod-like conformation of EBV gH/gL. A significant difference between EBV and HCMV gH is the presence in the latter of three additional N-terminal β-strands that interact with residues from gH.
The ULs form a central core domain flanked at opposite ends by two small globular domains (
10P3 (site 4) and 15D8 (site 1) bind into the concave surface of the ULs, and 10F7 (site 2) binds on the other face of the Pentamer along the UL130/UL131A β sheet.
9I6 CDRs contact both UL128 (residues 47-52 on the chemokine domain and residues 92-93 and 106-109 on α2 β4 β5 β6) and UL131A (residues 23-24 and 27-31). The epitope is consistent with published NS-EM data and mapping studies suggesting that site 5 antibodies require all three ULs for binding, likely due to the co-folding of UL130 and UL131A.
The main feature of the Pentamer-8I21 Fab complex is the interaction between the long HCDR3 of the Fab and the UL130 chemokine domain. Arg104 and Trp105, at the tip of HCDR3, protrude into a crevice composed of hydrophobic and polar residues from the N-terminal UL130-α1 helix and UL130/gL β-sheet establishing H-bonds with UL130 Ser47 and gL Asp156, respectively. Mutation of HCDR3 Trp105 to alanine resulted in an over 150-fold decrease in binding affinity consistent with its prominent role in the interaction. Several other interactions stabilize the complex including hydrogen bonds between the guanidinium group of HCDR3 Arg107 and the hydroxyl group of UL130 Tyr46, and H-bonds between main chain carbonyl oxygens of HCDR3 and the side chains of LCDR3 Trp94 and Trp97.
In Summary:
The Pentamer structure reveals the presence of small interfaces between some of the domains, as well as the presence of several cavities at the domain interfaces (
The Pentamer structure revealed herein shows that the gH and gL components of the complex have a close structural similarity with EBV gH/gL while the ULs are characterized by an α/β core domain flanked at opposite ends by UL128 and UL130 chemokine domains. Notably, HCMV gL has a unique N-terminal extension, missing in EBV and HSV gLs, which forms a docking site for the UL128 C-terminal α3 helix and the UL130 chemokine domain.
Characteristic features of the Pentamer structure include relatively small interfaces and cavities between some domains, suggesting intrinsic flexibility of the complex. Structure comparisons revealed large rigid body rotations of the gH/gL D-I domain and ULs arm around the gH D-I/D-II linker-helix resulting in a large displacement of the ULs. Though the 8I21 epitope remains the same in the Pentamer-9I6 Fab complex, it is believed that antibody binding stabilizes Pentamer in discrete, yet different, conformational states. Indeed, single particle reconstructions reveal a large rigid body rotation of the ULs in Pentamer bound to 10F7 Fab compared to the 8I21 and 9I6 Fab complexes. Consistent with this belief, HDX-MS analysis of Pentamer-Fab complexes shows that antibodies stabilize regions of Pentamer distant from their corresponding epitopes. Therefore, the combination of HDX-MS and crystallographic data reveal areas of Pentamer where mutations may be introduced to generate a more stable complex (see, e.g., McLellan et al., 2013 Science 342: 592-598, showing greater stability of a pre-fusion conformation of the respiratory syncytial virus fusion protein was linked with improved immunogenicity).
This structural analysis of the Pentamer-Fab complexes together with cell binding analysis provides new insights into the mechanism of antibody-mediated HCMV neutralization. We show that Pentamer binds to adult retinal pigment epithelial cells (ARPE-19) and human umbilical vein endothelial cells (HUVEC) cells but not MRC-5 cells.
Pre-incubating Pentamer with mAbs 15D8 (site 1), 10P3 (site 4), 2C12 and 9I6 (site 5) or 7I13 (site 6), inhibited Pentamer binding to endothelial cells (
In contrast, antibody binding to the elbow of the ULs arm (4N10 and 8121, sites 3 and 7 respectively) and the solvent exposed side of the UL130/UL131A β-sheet (10F7, site 2) did not affect Pentamer binding to cells, suggesting a different mechanism of neutralization. 8I21 binds to a positively charged surface on UL130 with a long heavy chain CDR3 (HCDR3) simultaneously protruding into a groove in the UL130 chemokine domain and contacting UL130 N-terminal residues, both of which are implicated in receptor binding in chemokines. It is believed that this site in UL130 binds to a co-receptor at the cell surface or in a post-entry step. Therefore, site 2 and site 3/7 antibodies may interfere with these interactions and processes without affecting Pentamer binding to cells.
Without wishing to be bound by theory and based on the structural and functional characterizations of neutralizing mAbs, the inventors propose a potential mechanism for Pentamer-mediated activation of HCMV entry. The inventors suggest that a cell receptor would bind to a surface in proximity of UL128 and the epitopes for site 1 and 4/6 neutralizing antibodies. Receptor binding may in turn result in a rotation of gH/gL D-I mediated by the UL128 linker and UL128-3 interaction with the gL 3-helix bundle. Repositioning of D-I may affect the width of the D-I/D-II groove, implicated in gB binding in EBV gH/gL, ultimately triggering membrane fusion.
In conclusion, the herein described structure of Pentamer reveals binding sites for potent and broadly neutralizing mAbs suggesting the location of important functional sites and targets for antibody therapeutics. The structures also reveal a dynamic repositioning of the ULs upon antibody binding suggesting a mechanism of ligand-induced conformational change during cell entry. Finally, the structural, biochemical and cell-based functional analyses of HCMV Pentamer reported here provide an atomic-level framework for at least the mechanism of Pentamer activity and for antigen design.
3.2 Analysis of the Glycans that Mask Pentamer Epitopes and Deglycosylation Mutations
Seven neutralizing epitopes (sites 1 to 7) on Pentamer have previously been identified and broadly mapped (See, e.g., Macagno et al., 2010 J. Virol. 84: 1005-1013 and U.S. Pat. No. 9,527,902). Five of the sites are non-overlapping (sites 1, 2, 3, 4, and 5) but site 3 overlaps site 7 and site 4 overlaps site 6. Antibodies that bind those seven sites are also known and include, for example, the 15D8 antibody known to bind site 1, the 10F7 antibody known to bind site 2, the 4N10 antibody known to bind site 3, the 10P3 antibody known to bind site 4, the 9I6 and 2C12 antibodies known to bind site 5, the 7I13 antibody known to bind site 6, and the 8I21 antibody known to bind site 7 (See, e.g., Macagno et al., 2010 J. Virol. 84: 1005-1013 and U.S. Pat. No. 9,527,902). Using the Pentamer structure obtained as described above, the present inventors characterized site 1, 4, 5, and 2 epitopes with greater specificity by first mapping Pentamer epitope sequences (the amino acid residues of neutralizing epitopes on the Pentamer) with differential HDX incorporation upon Fab binding (15D8, 10P3, 2C12, and 10F7 Fabs were used as representative neutralizing antibodies that bind to sites 1, 4, 5, and 2, respectively). Second, the inventors manually inspected and analyzed X-ray structures, HDX-MS data, and EM fitting results to propose that the site 1 epitope sequence corresponds to pUL128 residues 149-171 numbered with respect to the sequence SEQ ID NO: 13; the site 4 epitope sequence corresponds to pUL128 residues 56-72 and 131-148 numbered with respect to the sequence SEQ ID NO: 13; the site 5 epitope sequence corresponds to pUL131A residues 31-40 and 42-56 numbered with respect to the sequence SEQ ID NO: 21; and that the site 2 epitope sequence corresponds to pUL131A residues 92-122 numbered with respect to the sequence SEQ ID NO: 21. The locations of each of these epitope sequences on the surface of the Pentamer structure are shown in
The general locations of eleven glycans on the surface of Pentamer are also shown in
The inventors have selected additional glycans which are in close proximity to a neutralizing epitope and that are likewise expected to limit the accessibility of their respective epitope(s): glycans at gH-Asn55, gH-Asn62, gH-Asn67, gH-Asn192, gH-Asn641, and gH-Asn700 numbered with respect to SEQ ID NO: 1; at gL-Asn74 numbered with respect to SEQ ID NO: 7; pUL130-Asn201 (which are in addition to pUL130-Asn85 and pUL130-118) numbered with respect to SEQ ID NO: 17; and pUL131A-Asn81 numbered with respect to SEQ ID NO: 21. The inventors therefore expect that by removing one or more of the above-listed glycans, the corresponding epitope(s) will be more accessible. In particular, that removing the glycan will “unmask” the epitope. The inventors specifically propose the substitution of an above-listed asparagine residue for any non-asparagine amino acid (e.g., glutamine) using known techniques so as to prevent N-linked glycosylation at that location and thereby unmask an epitope of Pentamer. Whether or not a deglycosylation mutation unmasks an epitope and/or renders the epitope more accessible may be assayed by comparing the mutant polypeptide's or mutant complex's antigenicity to that of a non-mutant polypeptide or complex using known techniques (see, e.g., Zhou et al., Quantification of the Impact of the HIV-1-Glycan Shield on Antibody Elicitation, 2017 Cell Reports 19:719-732; and Ma et al. Envelope Deglycosylation Enhances Antigenicity of HIV-1 gp41 Epitopes for Both Broad Neutralizing Antibodies and Their Unmutated Ancestor Antibodies, 2011 PLoS Path. 7(9), e1002200). Enhanced binding of a neutralizing antibody to the epitope (i.e., increased antigenicity) indicates that the deglycosylation mutation has the effect of unmasking the epitope and/or making the epitope more accessible (Id.).
Based on the known positive affect deglycosylation and the resulting unmasking of epitopes has had on the immunogenicity of various antigens, but not wishing to be bound by theory, it is believed that by unmasking an HCMV Pentamer epitope via the deglycosylation mutations described herein, the mutant HCMV polypeptide (or fragment thereof) or mutant HCMV complex will have an increased immunogenicity as compared to a non-mutant (e.g., wild type) polypeptide or complex, respectively (see Liu et al., Unmasking Stem-Specific Neutralizing Epitopes by Abolishing N-Linked Glycosylation Sites of Influenza Virus Hemagglutinin Proteins for Vaccine Design, 2016 J. of Virol. 90(19): 8496-8508; Zhou et al., 2017 Cell Reports 19:719-732; and Ma et al., 2011 PLoS Path. 7(9), e1002200). An increase in antigenicity following deglycosylation has been an acceptable proof-of-concept for increasing immunogenicity via deglycosylation (See, e.g., Ma et al., 2011 PLoS Path. 7(9), e1002200).
3.3 Computational Analysis of the Amino Acids Involved in Pentameric Complex Stability
In addition to manual inspection, analyses of the structures from section 3.1 above was performed using the molecular graphics programs Pymol (The PyMOL Molecular Graphics System, Version 1.8 Schrödinger, LLC, available at WorldWideWeb(www).pymol.org) and the Crystallographic Object-Oriented Toolkit (“Coot”) (Emsley et al., 2010, available at WorldWideWeb(www)2.mrc-lmb.cam.ac.uk/Personal/pemsley/coot/). Thereafter, the Molecular Operating Environment (MOE) software (REF: Molecular Operating Environment (MOE) software; Chemical Computing Group Inc., available at WorldWideWeb(www).chemcomp.com) was used. In particular, the “Residue Scanning” and “Sample Sequence” variants of the “Protein design” module within MOE were applied to the pentameric complex crystal structure obtained as described in the sections above. “Residue Scanning” simulation sequentially mutates all or a subset of residues of the input protein. “Sample Sequence” mutates, in a random fashion, all of the specified residues. Both single and multiple point mutations were generated using MOE.
In addition, the in silico “Disulfide Scan” variant of the Protein Design module within MOE was applied to the crystal structure of the pentameric complex obtained as described in the sections above. With this tool, the surrounding environment of a residue was analyzed to evaluate the presence of other residue(s) close enough to form a disulfide bond. In general terms, two residues were characterized as close enough to form a disulfide bond if the β carbons of the two amino acids are within 5 Å of each other. Once pairs of such residues were selected, MOE was utilized to mutate both residues to Cysteine and to characterize the resulting impact on the stability of the protein complex.
The results from each MOE protocol were compiled and manually inspected. For all MOE protocols above, the impact of each proposed mutation was scored for stability using the delta stability scoring method (indicated as dS, and measured as kcal/mol). The dS is defined in MOE as the relative thermo-stability of a certain mutation with respect to the wild type or non-mutant protein, and more negative values of dS indicate more stable mutations. This scoring method takes into account both the conformational change upon mutation and the change with respect to the wild type or non-mutant. Further, dS is predicted from the difference in stability between a mutant protein and its wild type (or the non-mutant) both in the folded and unfolded states.
4.1 Characterization of Complex Stability with Respect to at Least gH
Structural analyses of gH revealed the presence of several buried cavities mainly localized in the gH N-terminal region (near the interface with gL) and in the C-terminus of the gH ectodomain. With respect to the gH sequence SEQ ID NO: 3, residue A102 of wild type (WT) gH is located on a short α-helix (made of gH residues 100-108) that runs parallel to a gL α-helix (made of gL residues 216-234, with respect to gL sequence SEQ ID NO: 7). The methyl (CH3) side-chain group of gH A102 points towards the interface with the gL α-helix, where an apparent cavity is present. The inventors therefore suggest that a cavity filling mutation of gH A102 might contribute to better packing within this buried interface environment. In addition, gH residue H480 is observed to be partially exposed on the surface and is thus identified by the inventors as a target site for a repacking mutation to introduce more favourable packing with the residue's surrounding environment. Also, in view of the introduction of two additional disulphide bridges at the gH-gL interface (see mutations V109C(gH)-G224C(gL) and L111C(gH)-G218C(gL) at Table 11), the inventors suggest that this region be stabilized by introducing a disulfide bridge mutation and thereby locking together in a more rigid fashion gH to the gL α-helix at gL residues 216-234.
4.2 Characterization of Complex Stability with Respect to at Least gL
The localization of buried cavities within gL spans the entire N- to C-terminus buried regions. Notably, gL residue H177 (numbered with respect to gL sequence SEQ ID NO: 7) is located centrally within gL, sandwiched between three structures: one loop on its side, one loop on its bottom, and an α-helix on top. The sidechain of gL H177 points towards a cavity that covers most of the region within these two loops. Therefore, the inventors propose a mutation of H177 into a hydrophobic, cavity filling residue to stabilize this region (i.e., introducing a cavity filling mutation at gL H177). Likewise, because at gL residue G224 and its surrounding environment, a large cavity can be observed buried between a gH helix-loop-helix motif (gH residues corresponding to 108-122 of SEQ ID NO: 3) and the gL α-helix (gL residues corresponding to 216-234 of SEQ ID NO: 7), the inventors propose introducing a cavity filling mutation there (gL residue G224) to stabilize the region. Also, at the interface between the gH helix-loop-helix region at gH residues 92-101 (numbered with respect to SEQ ID NO: 3) and the gL helix-loop-helix region at about gL residues 228-242 (numbered with respect to SEQ ID NO: 7), there is a cavity anterior to the sidechain of gL C233. The inventors therefore propose introducing a cavity filling mutation or a hydrophobic mutation at gL residue C233. Finally, the gL-UL130 interface is made of main- and side-chain hydrogen bonds and hydrophobic contacts of large side chains. Based in part on structure analyses of this region, the inventors suggest the insertion of a disulphide bond into the gL-UL130 interface via a disulfide bridge mutation to lock together these two subunits (by, for example, a G161C mutation at the gL residue corresponding to G161 of SEQ ID NO: 7 with a P64C mutation at the UL130 residue corresponding to P64 of SEQ ID NO: 17).
4.3 Characterization of Complex Stability with Respect to at Least UL128
The UL128 residue corresponding to G123 of SEQ ID NO: 13 is located below a significant buried cavity and is buried in a small domain of 30-residues at the interface between pUL128 and pUL130 and pUL131A. At least because residue G123 is located below a significant buried cavity, the inventors propose making cavity filling mutations at UL128 G123 to increase complex stability. In addition, the small UL128 G145 residue (numbered with respect to SEQ ID NO: 13) is located in the final UL128 C-terminal α-helix that docks onto the gL 3-helix bundle, and is therefore an inventor-identified target for introducing a repacking mutation to increase interactions at the gL-UL128 interface and, thereby, increase complex stability. Finally, due to the environment and three-dimensional arrangement of residues UL128 R142 and UL130 E95 (numbered with respect to UL128 sequence SEQ ID NO: 13 and UL130 sequence SEQ ID NO: 17, respectively) along the 50 Å-long linker that connects the N-terminal domain of UL128 to gL, the inventors propose introducing a UL128 R142C and a UL130 E95C disulfide bridge mutation into the complex to lock UL128 to UL130 and to thus contribute to increased stability of the ULs region and increased stability of the complex generally.
4.4 Characterization of Complex Stability with Respect to at Least UL130
In the pUL130-pUL131A interface, pUL130 residue H209 (numbered with respect to pUL130 sequence SEQ ID NO: 17) points towards pUL131A H35 (numbered with respect to pUL131A sequence SEQ ID NO: 21), based on which the inventors suggest that if protonated, these histidines would repel each other causing conformational changes or conferring flexibility to this region. Therefore, the inventors propose introducing a repacking mutation into the residue corresponding to UL130 H209 to decrease flexibility of this region of the ULs and increase complex stability. Moreover, a significant buried cavity is detected near pUL130 H209 into which the inventors propose introducing either or both of a cavity-filling and repacking mutation for increased complex stability. Also in the pUL130-pUL131A interface, the pUL130 residue corresponding to H150 of the sequence SEQ ID NO: 17 points towards pUL131A H69 (numbered with respect to SEQ ID NO: 21), so the inventors likewise propose introducing a repacking mutation into pUL130 H150 to decrease flexibility of this UL region and increase complex stability.
4.5 Characterization of Complex Stability with Respect to at Least pUL131A
In the pUL130-pUL131A interface (αβ-core), pUL131A G99 (numbered with respect to pUL131A sequence SEQ ID NO: 21) lies on a peripheral strand of the flat 130-131-mixed large β-sheet, and its C-b backbone atom (i.e., the first carbon atom of the G99 side chain) points towards the buried region where pUL130 and pUL131A helices as well as a large buried cavity are located. The inventors target pUL131A G99 for the introduction of a cavity-filling mutation to enhance complex stability. Similarly, pUL131A residue S86 (numbered with respect to pUL131A sequence SEQ ID NO: 21) is located on a pUL131A helix and points towards the flat large pUL130-pUL131A mixed β-sheet. Therefore, the inventors propose introducing a cavity-filling mutation at pUL131A S86 for increased complex stability. Likewise, pUL131A residue H64 (numbered with respect to pUL131A sequence SEQ ID NO: 21) points towards the buried environment between the pUL130-pUL131A mixed αβ-core, so the inventors propose introducing a repacking mutation (specifically, a hydrophobic mutation) of pUL131A H64 to, for example, a bulky more hydrophobic residue (such as tryptophan (W)) to introduce interactions that increase thermal stability of the complex.
4.6 Designed Stabilizing Mutants
As indicated in the sections above, the inventors conducted computational analysis (summarized above at Example 3.3), processed that data by manual inspection and manipulation (summarized above at Examples 4.1-4.5), and have thereby designed mutations within gH, gL, pUL128, pUL130, and pUL131A to, when alone or combined, improve the stability of a complex comprising them. While not wishing to be bound by theory, it is believed that conformational flexibility is decreased and the over-all thermostability of a complex is increased by (A) filling buried cavities between domain interfaces with atoms from amino acids made of longer side-chains than the one found in the wild type protein, by (B) increasing contacts of neighboring residues or replacing unfavorable clusters of charged residues, and/or by (C) introducing intra- or inter-disulfide bridges throughout the structure. Exemplary inventor-designed mutants to effect one or more of A-C are listed in Tables 15-21 below.
While the below tables are organized so as to exemplify specific mutations within a particular pentamer complex subunit, the designed mutants include combinations of these mutations (“stacked mutations”, i.e., combinations of mutations within one polypeptide (e.g., several mutations within gH) as well as combinations of mutations within different polypeptides (e.g., mutant gH and mutant gL, both of which may comprise more than one mutation). By way of example, an inventor-designed mutant HCMV pentamer complex comprises any one or more mutations listed within Tables 20 and 21 below.
A102W
H177W
G123W
A372F
D165W
G99W
G224F
H209Y
S86W
G140W
G145W
V77F
S90F
A352F
L103F
Q119F
D146W
G218L
L257W
L119W
P272F
C233F
C233W
C233L
While the above tables 15-20 exemplify disulfide bridge mutations for introduction into a complex subunit (e.g., into one of the pentamer complex subunit proteins gH, gL, pUL128, pUL130, and pUL131A), because disulfide bridges are formed between two residues (two cysteine residues within the same polypeptide (intra disulfide bridge) or between two cysteine residues, one cysteine in each of a first polypeptide and a second polypeptide (inter disulfide bridge)), the inventors provide at Table 21 below a summary of combined designed disulfide bridge mutations (numbered in the left-most column as groups of disulfide bridge mutations) to increase the stability of a complex comprising them. The combinations of disulfide bridge mutations provided in Table 21 may be stacked (for example, the mutations listed in Group No. 1 may be combined with the mutations listed in Group No. 2) or combined with at least one cavity-filling mutation and/or repacking mutation listed in Tables 15-20 above.
4.7 Express and Purify a Complex Comprising at Least One Designed Mutant
A HCMV pentameric complex shows bimodal unfolding, which is reflected by it having two thermal transition midpoint (Tm) values (two peaks) in a differential scanning fluorimetry (DSF) assay (the first, lower temperature value being referred to as “Tm1” and the second, higher temperature value being referred to as “Tm2”). A cavity-filling, repacking, disulfide bridging, or deglycosylation mutation of the present invention may shift (i.e., increase or decrease) one or several Tm peaks of a complex. Increasing a Tm value (i.e. shifting a Tm peak toward the right in the resulting sigmoidal graph of a DSF assay) is denoted with a temperature change greater than zero (a positive Tm shift). Decreasing a Tm value (i.e., shifting a Tm peak toward the left) is denoted with a temperature change less than zero (a negative Tm shift). For an HCMV pentamer complex, a mutation of the present invention may shift just one, or both, of Tm1 and Tm2. For these experiments, whether or not a mutation, or combination of mutations, is Stabilizing, Neutral, or Destabilizing depends on the size of the Tm1 shift, Tm2 shift, or both. Increasing at least one of Tm1 and Tm2 by at least 2° C. as compared to control is considered an increase in the thermostability of the HCMV complex (i.e., the mutation or combination of mutations are said to be “Stabilizing” if they result in a Tm1 shift or Tm2 shift of at least 2° C. as compared to control). Increasing at least one of Tm1 and Tm2 by at least 5° C. as compared to control is considered a significant increase in the thermostability of the complex (i.e., the mutation or combination of mutations are said to be “Significantly Stabilizing” if they result in a Tm1 shift or Tm2 shift of at least 5° C. as compared to control). When both of the Tm1 shift and Tm2 shift values are between 2° C. and −2° C. (exclusive of endpoints) as compared to control, the mutation or combination of mutations are said to be “Neutral.” When at least one of the Tm1 shift and Tm2 shift values is −2° C. or less as compared to control, the mutation or combination of mutations are said to be “Destabilizing.” Here, if the Tm1 shift, Tm2 shift, or both and/or any effect on stability could not be evaluated due to, for example, insufficient expression or amount of protein in the assay, the mutant or combination of mutants are labelled “Inconclusive.”
Modified HCMV pentameric complexes were expressed and purified as described in Example 1 and 2 above. The HCMV pentameric complexes each comprised at least one modified subunit polypeptide gH, gL, UL128, UL130, and pUL131A, wherein the modified subunit polypeptide comprised the mutations as listed in Tables 22-23. Purified, mutant HCMV pentamer complex was assessed in a differential scanning fluorimetry (DSF) assay, using a Nanotemper Prometheus NT.48 instrument. The intrinsic fluorescence of aromatic residues, such as Tyr and Trp, was obtained by exciting at 280 nm wavelength and measuring emission spectra at 330 nm (representing the folded state) and 350 nm (representing the unfolded state) over a temperature ramp (25-85° C.). The instrument software was used to plot the differential of the fluorescence ratio (350 nm/330 nm), and the temperature corresponding to the inflection point of the curve was taken as the melting temperature of the mutant HCMV pentamer complex (the Tm). The Tm1 and Tm2 of each mutant HCMV pentamer complex was compared to the corresponding Tm1 or Tm2 of a control (non-mutant) HCMV pentamer complex. The control complex utilized was a HCMV pentamer complex comprising a gH complex-forming fragment and a gL polypeptide having mutations within a protease recognition site (specifically, the gH polypeptide comprised the sequence SEQ ID NO: 4; the gL polypeptide comprised the sequence SEQ ID NO: 10; the pUL128 polypeptide comprised the sequence SEQ ID NO: 14; the pUL130 polypeptide comprised the sequence SEQ ID NO: 18; and the pUL131A polypeptide comprised the sequence SEQ ID NO: 22). Any change (shift) in Tm1 and/or Tm2 as compared to control was calculated, if possible, and is summarized in Tables 22 and 23. The residue numbers in Tables 22 and 23 are with respect to gH sequence SEQ ID NO: 3, gL sequence SEQ ID NO: 7, pUL128 sequence SEQ ID NO: 13, pUL130 sequence SEQ ID NO: 17, and pUL131A sequence SEQ ID NO: 21, respectively. An increase of at least 5° C. in the calculated Tm is considered a significant increase in the thermostability of the complex. The stabilizing mutant HCMV pentamer complexes described in Tables 22 and 23 were assayed for the presence of wild type conformational epitopes using Bio-layer Interferometry (BLI) technology and the 10P3 antibody. All complexes listed in Table 22 and 23 maintain the wild type HCMV pentamer complex conformational epitope recognized by the 10P3 antibody.
4.8 Effect of Designed Mutations on Stability of the Pentameric Complex
While some of the mutant HCMV pentamer complexes produced by these methods were more thermostable than the control, others were neutral in effect or destabilizing. Mutant HCMV pentamer complexes comprising disulfide mutations within gL and pUL130 polypeptides; pUL128 and pUL130 polypeptides; or within just the pUL128 polypeptide had an increased thermostability as compared to control (Table 22). Likewise, mutant HCMV pentamer complexes comprising a cavity filling mutation in the gL polypeptide; pUL128 polypeptide; or pUL131 polypeptide had an increased thermostability as compared to control (Table 22).
Stacking (combining) one type of mutation (see column A of Table 23) with a second type of mutation (see column B of Table 23) was also stabilizing (Table 23). In particular, combining a cavity filling mutation within gL with disulfide bridge mutations within pUL128 and pUL130; combining disulfide bridge mutations within pUL128 and pUL130 with a deglycosylation mutation within gL; combining a repacking (hydrophilic) mutation within gH with disulfide bridge mutations within pUL128 and pUL130; combining a repacking (hydrophobic) mutation within gL with disulfide bridge mutations within pUL128 and pUL130; and combining two repacking (hydrophobic) mutations within pUL131 with a cavity filling mutation within gL were all stabilizing (Table 23).
Without wishing to be bound by theory, it is believed that these cavity filling, repacking, and disulfide bridge mutations enhance the thermostability of the HCMV pentameric complex by optimizing local interactions or decreasing conformational flexibility. It is likewise believed that these deglycosylation mutations make the HCMV pentamer complex epitopes more accessible.
Amino acid sequences written in N-terminus to C-terminus direction, nucleic acid sequences written in 5′- to 3′ direction:
MRPGLPSYLIILAVCLFSHLLSSRYGAEAVSEPLDKAFHLLLNTYGRPIRFLRENTTQCTYNSSLRNSTVVRENA
MRPGLPSYLIILAVCLFSHLLSSRYGAEAVSEPLDKAFHLLLNTYGRPIRFLRENTTQCTYNSSLRNSTVVRENA
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MSPKDLTPFLTALWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFD
MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFD
MRLCRVWLSVCLCAVVLGQCQRETAEKKRLLPSTALLGRVLSRAARPNPLQVCGTARGPHVELPLRCEPRLGQL
MGKKEMIMVKGIPKIMLLISITFLLLSLINCNVLVNSRGTRRSWPYTVLSYRGKEILKKQKEDILKRLMSTSSDG
MGRKGEMRGVFNLFFLMSLTFLLFSFINCKIAVARFRVKSQKAKEEERQLKLRILQELASKTGDYYKFFTFPSQQ
MGRKEMMVRDVPKMVFLISISFLLVSFINCKVMSKALYNRPWRGLVLSKIGKYKLDQLKLEILRQLETTISTKYN
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MRPGLPSYLIILAVCLFSHLLSSRYGAEAVSEPLDKAFHLLLNTYGRPIRFLRENTTQCTYNSSLRNSTVVRENA
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVQV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEGDKYESWLRPLVNV
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKCCNRFTVALRCPDGEVCYSPEKTAEIRG
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
I
EADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ
MSPKDLTPFLT
T
LWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRG
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFCCPFLYPSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYCSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSCPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYPSPPRSPLQFSGFQ
MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKFVEQLVDLTLNYHYDVSHGLDNFD
MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFD
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2018/000491 | 4/18/2018 | WO |
Number | Date | Country | |
---|---|---|---|
62487065 | Apr 2017 | US | |
62504059 | May 2017 | US | |
62523465 | Jun 2017 | US |